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Preface 


This essay explores some conceptual foundations for understanding the natural 
causes of linguistic systems. At the core of it are three ideas. 

The first is that causal processes in linguistic reality apply in multiple frames 
or “time scales” simultaneously, and we need to understand and address each 
and all of these frames in our work. This is the topic of Chapter 2. 

This leads to the second idea. For language and the rest of culture to exist, its 
constituent parts must have been successfully diffused and kept in circulation 
in the social histories of communities. This relies on convergent processes in 
multiple causal frames, and depends especially on the micro-level behavior of 
people in social interaction. This is the topic of Chapter 3. 

The third idea, building on this, is that the socially-diffusing parts of language 
and culture are not just floating around, but are firmly integrated within larger 
systems. We need to understand the link between the parts and the higher-level 
systems they belong to. This point is underappreciated. Inferences made from 
facts about items are often presented without reflection as being facts about the 
whole systems they fit into. Tree diagrams help to perpetuate this problem. It is 
difficult to assess work on the history of languages if that work does not offer 
a solution to the item/system problem. Facts about items need to be linked to 
facts about systems. We need a causal account of how it is that mobile bits of 
knowledge and behavior become structured cultural systems such as languages. 
This is the topic of Chapter 4 (where the problem is articulated) and Chapter 5 
(where a solution is offered). 

In exploring these ideas, this book suggests a conceptual framework for ex- 
plaining, in causal terms, what language is like and why it is like that. It does not 
attempt to explain specifics, for example why one language has verbal agreement 
involving noun class markers and another language does not. But the basic ele- 
ments of causal frames and transmission biases, and the item/system dynamics 
that arise, are argued to be adequate for ultimately answering specific questions 
like these. Any detailed explanation will work - explicitly or implicitly - in 
these terms. Here is another thing this book does not do: It does not give de- 
tailed or lengthy case studies. Instead, the examples are illustrative, and many 
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can be found in the literature referred to. The Conceptual Foundations of Lan- 
guage Science book series is intended for short and readable studies that address 
and provoke conceptual questions. While methods of research on language keep 
changing, and often provide much-needed drive to a line of work, the underlying 
conceptual work - always independent from the methods being applied - must 
provide the foundation. 


1 Causal units 


What is the causal relationship between the bits of language - sounds, words, 
idioms - and the whole systems that we call languages? A way into this question 
is to ask why any two languages might share a trait. There are four possible 
reasons: 


0. Universal presence: All languages must have the trait; therefore A and B 
have it. 


1. Vertical transmission: The trait was inherited into both A and B from a 
single common ancestor language. 


2. Horizontal transmission: The trait was borrowed into one or both of the 
languages (from A into B, from B into A, or from a third language into 
both A and B). 


3. Internal development: The trait was internally innovated by both A and B, 
independent from each other.! 


Leaving aside universals, the three possibilities (1-3) involve processes that 
are often considered to be qualitatively different, namely (1) inheritance (from 
mother to daughter language), (2) borrowing (from neighbouring language to 
neighbouring language through contact among speakers), and (3) natural, inter- 
nally motivated development. But at a fundamental level these processes are not 
distinct: 


Language change by contact or otherwise is a process of social diffusion. 
The standard analytical distinction between internal and external linguistic 
mechanisms diverts attention from the fact that these are instances of the 
same process: the diffusion of cultural innovation in human populations. 
(Enfield 2005: 197) 


1 Tf the two languages possessed the same starting conditions for the same internal innovation, 
the question arises as to why they shared those starting conditions in the first place. This takes 
us back to the question “Why do two languages share a trait?”. 


1 Causal units 


This is the conclusion I came to when considering possible explanations for 
convergence of structure among neighboring language communities in the main- 
land Southeast Asia area. As I put it then: 


Areal linguistics invites us to revise our understanding of the ontology 
of languages and their historical evolution, showing that the only units 
one needs to posit as playing a causal role are individual speakers and 
individual linguistic items. These unit types are mobile or detachable with 
respect to the populations they inhabit, arguing against essentialism in 
both linguistic and sociocultural systems. 


Areal linguistics presents significant challenges for standard understand- 
ings of the ontology of language from both spatial and temporal perspec- 
tives. Scholars of language need to work through the implications of the 
view that “the language” and “the community” are incoherent as units of 
analysis for causal processes in the historical and areal trajectories of lan- 
guage diffusion and change. (Enfield 2005: 198) 


In this book I explore some implications of these conclusions. When we grap- 
ple with puzzles of inheritance, contact, and diffusion in the history of languages, 
we have to confront the item/system problem (see Chapter 4), and its collateral 
challenges. 

The three processes mentioned above - inheritance, borrowing, innovation — 
can only take place when there is social contact between people, and successful 
diffusion of types of behavior in communities. These are causal preconditions. 
For any of the three processes to succeed, several things have to happen. People 
have to start saying things in new ways (or saying new things), exposing others 
in their personal network to new ideas. Those who are exposed then have to 
copy this new behavior, and they have to be motivated to do so. This in turn 
has to expose more people in their social networks, as well as further exposing 
those who began the process in the first place, validating and encouraging the 
new behavior, and leading it to take further hold. At a fundamental level, the 
three ways that something can get into a language are indistinguishable from 
one another. If there are differences, they have to do with where the idea came 
from, how natural the idea is (i.e. how much it makes sense and perhaps how 
much it helps cut corners in communication or processing), and what is the social 
identificational value of the idea. 


1.1 How we represent language change 


1.1 How we represent language change 


One way to understand something is to look at the history of events that created 
it. Consider the history of any type of life form. The central formative events 
take place in populations. Individuals inherit characteristics - for example, from 
the genome of their parents - and when those inherited characteristics can vary 
between individuals in a population, an individual with one variant might have 
a better chance of surviving than someone with another variant. When higher 
likelihood of survival means higher likelihood of reproduction, this can increase 
the frequency of an advantageous variant in the population. In time, the variant 
comes to be carried by all individuals. Two or more distinct populations emerge, 
and these may then be regarded as separate species. While the new populations 
share a common ancestor, they are now essentially different. 

This way of thinking about the causal basis of species in terms of population dy- 
namics is central in the theory of biological evolution (Darwin 1859; Mayr 1970). 
It can be applied to the evolution of life forms of all kinds, and to cultural types 
including kinship systems, technologies, and languages (Dawkins 1976; Mesoudi 
et al. 2006). The process of speciation in any of these forms of life implies rela- 
tions of common ancestry that may be represented using a tree diagram. Figure 
1.1 illustrates. 


A2 


Al A2a.1 A2a.2 A2b 


Figure 1.1: Tree diagram representing divergence by descent with modification. 
Al, A2a.1, A2a.2, and A2b are common descendants of A. 


Diversification of languages, as in the history of great stocks like Bantu, Aus- 
tronesian, and Indo-European, has long been represented with tree diagrams of 
this kind, in which the ostensible units of analysis are languages. By taking the 
language as the unit of analysis, tree diagrams must assume that languages co- 
here as units. Is this a fair assumption? Are language systems coherent, natural 
kinds? Or do we only imagine them to be? 


1 Causal units 


When tree diagrams are used to represent the history of diversification within 
a family of languages, there is an analogy with the kind of evolution seen in life 
forms that show a total or near-total bias toward vertical transmission in evo- 
lution, namely vertebrates such as primates, birds, fish, and reptiles. So let us 
consider what the tree diagram means in the case of vertebrate natural history. 
Each binary branching in the tree represents a definitive split in a breeding pop- 
ulation. The populations represented by daughter nodes inherit traits that were 
found in the parent population. Members of the daughter populations also com- 
monly inherit modifications of the parent traits that significantly distinguish the 
two daughter populations from each other. Inheritance happens in events of sex- 
ual reproduction, in which complete genotypes are bestowed in the conception 
of new individuals. This encapsulation of the genome in causal events of inher- 
itance ensures the vertical transmission that a tree diagram represents so well. 
In vertebrate species, when two populations are no longer able to interbreed, 
they can no longer contribute to each other’s historical gene pool. This would 
be horizontal transmission, something that is essentially absent from vertebrate 
evolution (though with some caveats; Koonin 2009). The tree representation is 
adequate in the case of vertebrate speciation for one reason: the tree diagram 
does not capture horizontal transmission. The vertebrate genome is essentially 
acquired by the individual organism as a bundle. So the complete organism can 
reasonably be treated as a unit for describing transmission and change in phy- 
logeny. The vehicle for replication is the individual organism as defined by the 
structurally coherent entity that we call the body. 

The problem is that while vertebrates have been implicitly taken to be the 
model for language, they are not like language in causal terms. They are not even 
representative of life forms in general. Most forms of life, including not only the 
non-animal Eukaryotes, but also the Bacteria and Archaea, are not subject to 
strong vertical transmission constraints (Boto 2010). Most forms of life lack the 
bounded body plans that delineate vehicles or interactors for passing on replica- 
ble traits. The overall phenotypic structures of “individuals” in many species are 
to a large degree emergent. Evolutionary processes can be more clearly seen to 
operate on parts of organisms (Dawkins 1976). 


1.2 Linguistic systems 


People find it easy to accept “the language” as a unit of causal analysis. Our 
intuitions suggest that languages are effectively bounded, whole systems. We 
readily think of them as organisms. But they can also be thought of as focussed 


1.2 Linguistic systems 


bundles of items. Indeed they should be thought of in this way, for the “linguistic 
system” is not a natural kind. 

The point has been made for linguistic systems with most clarity and rigor 
by Le Page & Tabouret-Keller (1985). A prerequisite to the idea of a language 
(e.g. English) is the idea of a group of people who speak it. But as Le Page and 
Tabouret-Keller (1985) put it: 


Groups or communities and the linguistic attributes of such groups have 
no existential locus other than in the minds of individuals. (p. 4) We do 
not ourselves then need to put a boundary around any group of speakers 
and say “These are the speakers of Language A, different from Language 
B”, except to the extent that the people think of themselves in that way, 
and identify with or distance themselves from others by their behavior. (p. 
9) 


The point was made a half-century ago for social systems more generally by the 
anthropologist Edmund Leach (1964), in critiquing the structuralism of Radcliffe- 
Brown and students (Fortes & Evans-Pritchard 1940): 


Social systems were spoken of as if they were naturally existing real en- 
tities and the equilibrium inherent in such systems was intrinsic. (p. x) I 
do not consider that social systems are a natural reality. In my view, the 
facts of ethnography and of history can only appear to be ordered in a 
systematic way if we impose upon these facts a figment of thought. (p. xii) 


Fair enough. But there must be some natural reality upon which we may im- 
pose our figments of thought. One candidate is the economy of bits of language 
or culture, each of which has mobility: the words and other things that we can 
borrow from outside, without having to borrow the whole systems they come 
from. As Hudson (1996: 22) puts it: 


We need to distance ourselves somewhat from the concepts represented 
by the words language and dialect, which are a reasonable reflection of 
our lay culture, called “common-sense knowledge”, but not helpful in so- 
ciolinguistics. First, we need a term for the individual “bits of language” 
to which some sociolinguistic statements need to refer, where more global 
statements are not possible. 


Hudson introduces linguistic item as a term for this unit with causal reality. 
Suppose that items — in bundles - are what we impose an essence upon when 
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we imagine languages. Our vernacular language names would be labels for these 
imposed, imagined essences.? 


1.3 Linguistic items 


The idea that languages are causally real units gets weaker when we think ofthe 
mechanisms of language transmission, both across and within generations. There 
are two problems for the language-as-real-system idea. The first is that causal 
processes of transmission can be observed most concretely operating upon items 
(e.g., in the borrowing and learning of words), not on whole systems. The second 
is that horizontal transmission occurs. All parts of a language appear in principle 
to be independently mobile (though of course some bits of language travel more 
freely than others; Thomason & Kaufman 1988; Curnow 2001; Thomason 2001). 
Now consider these points more closely. 

What is transmitted in language history? It is not the whole system at once, but 
the components of the system, piece by piece and chunk by chunk, in millions 
of distinct events. Never all at once but at separate moments, over days, weeks, 
months and years. To be sure, the result of language transmission is a high degree 
of overlap among idiolects in a human population.’ The overlap is so high that 
our idiolects are practically indistinguishable. And this reassures us that systems 
are real wholes. How does this degree of idiolect overlap come about? 

Part of the answer is that speech communities are inward-focused. People ina 
group transmit linguistic items when they converse and interact. This creates an 
economy of signs, in the sense of Zipf (1949). When people in a group interact 
repeatedly, more signs come to be shared among those people. And the more that 
signs are shared, the more readily those people interact. This feedback effect 
in the social circulation of linguistic items is both a result of, and a cause of, 
common ground in a community. People have more common ground because 
they interact more; they interact more because they have more common ground. 
The basic causal units, though, are the shared items, not the systems that emerge. 

The second problem with “the language” as a natural unit is the ease of bor- 
rowing linguistic items. Languages constantly incorporate new structures, and 
quickly. When confronted with this kind of horizontal transmission, students 
of language change have looked for ways to distinguish it from a vertical signal, 


? Tf the reader is concerned that the true holistic system nature of languages is being underesti- 
mated, see Chapter 4, below. 

> On idiolect overlap or convergence, cf. Bakhtin (1981), Hockett (1987: 106-107, 157-158), Lee 
(1996: 227-228). 


1.4 Thinking causally about language change 


usually to then exclude it. But if horizontal transmission is so widespread, this 
should cause people to doubt the value of a model in which vertical transmis- 
sion is the main object of interest for representing and understanding language 
history. With a proper understanding of the causality of language change, we 
see that tree diagrams that take “the language” as the unit of analysis not only 
abstract from reality, they distort it. They are poor conceptual tools for under- 
standing the ontology of language. The solution is to change our assumptions 
about the causal units involved. 

For Darwinian evolution to occur, there must be a population of essentially 
equivalent but non-identical units. These units must inherit traits from compa- 
rable units that existed prior to them. And these inheritable traits must show 
variation that can result in comparable units having different chances of surviv- 
ing to pass on those traits to a new generation. What are the units? In the case 
of vertebrates, a received view is that two sorts of units work together: organ- 
isms, and genes. Organisms are vehicles for replicating genes. In vertebrates, 
the vehicles for inheritance of traits are the bodies of individuals. Each body is 
a phenotypic instantiation of the system. But here is the problem. The situation 
with languages is not like this at all. 


1.4 Thinking causally about language change 


We want a causal account of languages as historically evolved systems. To think 
concretely about this, consider the following. All the conventional bits of lan- 
guage you learned as an infant were created by enormous chains of social inter- 
action in the history of a population. Each link in the chain was an observable 
instance of usage, a micro-scale cycle of transmission, going from public (some- 
one uses a structure when speaking) to private (a second person’s mental state 
is affected when the structure is learnt or entrenched) and back to public (the 
second person uses the structure, exposing someone else), and so on. This may 
seem to be an overly micro-perspective way of putting it. But it is important to 
be explicit about the proximal mechanisms of transmission. Causal statements 
about language often highlight only a part of what is going on. 
Consider (1) and (2): 


1. Knowledge of grammar causes instances of speaking. 


2. Instances of speaking cause knowledge of grammar. 


1 Causal units 


Statement (1) focuses on competence. It points to mechanisms of, and prereq- 
uisites for, saying things. Statement (2) focuses on performance and emphasizes 
its outcomes. We learn about language from what people say. But there is no con- 
tradiction between the statements shown in (1) and (2). They are ways of framing 
the same thing. Competence and performance are equally indispensable in the 
processes of historical evolution that determine and constrain what a language 
can be like. Words are effectively competing for our selection (Croft 2000). If all 
goes well, we select the items that best enable us to manipulate other people’s 
attentional and interpretive resources (Enfield 2013: 16-17). 


1.5 The problem with tree diagrams 


Tree diagrams of language diversification are good for some things, but they are 
not good for representing causal processes of language history, nor the natu- 
ral, causal ontology of languages and language relatedness. The tree diagram 
assumes that we are primarily interested in one form of transmission of herita- 
ble characteristics, namely, vertical transmission of features from a parent to a 
daughter language, normally through first language acquisition in children. The 
alternative - horizontal transmission, i.e., transmission of features between lan- 
guages whose speakers are in contact, normally involving adult language learn- 
ing - is acknowledged but is regarded as noise that needs to be factored out 
from the vertical historical signal of primary interest (cf. Dixon 1997, and note 
that some recent work applying new methods is showing promising signs of a 
shift in direction here; e.g., Reesink et al. 2009). 

The tree diagram is a methodological simplification. It requires us to abstract 
from the causal facts. Of course this abstraction may be a harmless practical ne- 
cessity. But our question is whether the abstraction inherent in the tree diagram 
does conceptual harm. I think the answer is yes. It directs our attention away 
from the causal mechanisms that define language as an evolutionary process, 
and languages as evolved systems. 

To begin to think causally we first need to explore the multiple frames within 
which causal processes may be effected. This is the topic of the next chapter. 


2 Causal frames 


If you really want to understand language, you will have to study a lot of different 
things. Here are some: 


e The finely-timed perceptual, cognitive, and motoric processes involved in 
producing and comprehending language 


e The early lifespan processes by which children learn linguistic and com- 
municative knowledge and skills 


e The evolutionary processes that led to the unique emergence of the cogni- 
tive capacities for language in our species 


e The ways in which the things we say are moves in sequences of social 
actions 


e The mechanisms and products of language change, with links between his- 
torical processes and evolutionary processes 


e Linguistic variation and its role in how historical change in language takes 
place in human populations 


« Things that can be described without reference to process or causation at 
all, as seen in linguistic grammars, dictionaries, ethnographies, and typolo- 
gies, where relationships rather than processes are the focus 


These different points of focus correspond roughly with distinct research per- 
spectives. But they do not merely represent disciplinary alternatives. The dif- 
ferent perspectives can be seen to fit together as parts of a larger conceptual 
framework. 

To give some outline to that framework, I here define six interconnected frames 
for orienting our work. They remind us of the perspectives that are always avail- 
able and potentially relevant, but that we might not be focusing on. They do not 
constitute a definitive set of frames — there is no definitive set — but they are 


2 Causal frames 


useful. They correspond well to the most important causal domains. They conve- 
niently group similar or tightly interconnected sets of causal mechanism under 
single rubrics. And together they cover most of what we need for providing 
answers to our questions in research on language. 

The frames are Microgenetic, Ontogenetic, Phylogenetic, Enchronic, Diachronic, 
and Synchronic. The meanings of these terms are explicated below. As a mnem- 
onic, they spell MOPEDS. Frames like these are sometimes referred to as time 
scales. But calling them “scales” is not accurate. It implies that they all measure 
the same thing, just with arbitrarily different units of measure - seconds ver- 
sus minutes versus hours, etc. But the difference between, say, ontogenetic and 
diachronic (ditto for the other frames) is not defined in terms of abstract or objec- 
tive units of the same underlying stuff — time, in this case. The frames are defined 
and distinguished in terms of different types of underlying processes and causal- 
conditional mechanisms. For each frame, what matters most is how it works, not 
how long it takes. 

By offering a scheme of interrelated causal frames as part of a conceptual 
framework for research on language, I want to stress two points. 

The first is that these frames are most useful when we keep them conceptually 
distinct. Kinds of reasoning that apply within one frame do not necessarily apply 
in another, and data that are relevant in one frame might not be relevant (in the 
same ways) in another. Mixing up these frames leads to confusion. 

The second point is that for a full understanding of the things we study it is not 
enough just to understand these things from within all of the different frames. 
The ideal is also to show how each frame is linked to each other frame, and, 
ultimately, how together the frames reveal a system of causal forces that define 
linguistic reality. 


2.1 Distinct frames and forces 


The ethologist Niko Tinbergen famously emphasized that different kinds of re- 
search question may be posed within different theoretical and methodological 
frames, and may draw on different kinds of data and reasoning (Tinbergen 1963). 
See Table 2.1. 

Tinbergen’s four questions were applied in studying the behavior of non hu- 
man animals. The distinctions were designed to handle communication systems 
such as the mating behavior of stickleback fish, not the far greater complexities 
of language, nor the rich cultural contexts of language systems. If we are going 
to capture the spirit of Tinbergen’s idea, we need a scheme that better covers the 
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Table 2.1: Distinct causal/temporal frames for studying animal behavior, after 


Tinbergen (1963). 
Causal What is the mechanism by which the behavior occurs? 
Functional What is the survival or fitness value of the behavior? 


Phylogenetic How did the behavior emerge in the course of evolution? 
Ontogenetic How does the behavior emerge in an individual’s lifetime? 


phenomena specific to language and its relation to human diversity. 

Many researchers of language and culture have emphasized the need to moni- 
tor and distinguish different causal frames that determine our perspective. These 
include researchers of last century (Saussure 1916; Vygotsky 1962) through to 
many of today (Tomasello 2003; MacWhinney 2005; Raczaszek-Leonardi 2010; 
Cole 2007; Donald 2007; Larsen-Freeman & Cameron 2008; Uryu et al. 2014; 
Lemke 2000, 2002). We now consider some of the distinctions they have offered. 

The classical two-way distinction made by Saussure (1916) — synchronic versus 
diachronic - is the tip of the iceberg. In a synchronic frame, we view language 
as a static system of relations. In a diachronic frame, we look at the historical 
processes of change that give rise to the synchronic relations observed. But if 
you look at the dynamic nature of language you will quickly see that diachrony 
- in the usual sense of the development and divergence of languages through 
social history - is not the only dynamic frame. 

Vygotsky distinguished between phylogenetic, ontogenetic, and historical pro- 
cesses, and stressed that these dynamic frames were distinct from each other 
yet interconnected. His insight has been echoed and developed, from psycholo- 
gists of communication like Tomasello (1999) and Cole (2007) to computational 
linguists like Steels (1998, 2003) and Smith et al. (2003). 

Smith et al. (2003: 540) argue that to understand language we have to see it 
as emerging out of the interaction of multiple complex adaptive systems. They 
name three “time scales” that need to be taken into account - phylogenetic, on- 
togenetic, and glossogenetic (= “cultural evolution”, i.e., diachronic) - thus echo- 
ing Vygotsky. Language is, they write, “a consequence of the interaction be- 
tween biological evolution, learning and cultural evolution” (Smith et al. 2003: 
541). Raczaszek-Leonardi focuses on psycholinguistic research, and proposes that 
three frames need to be addressed: online, ontogenetic and diachronic. She leaves 
out the phylogenetic frame, but adds the “online” frame of cognitive process- 
ing. Cole (1996: 185) expands the list of dynamic frames to include microgenesis, 
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ontogeny (distinguishing early learning from overall lifespan), cultural history, 
phylogeny, and even geological time. MacWhinney (2005: 193-195) offers a list of 
“seven markedly different time frames for emergent processes and structure”, cit- 
ing Tinbergen’s mentor Konrad Lorenz (1958). MacWhinney’s frames are phylo- 
genetic, epigenetic, developmental, processing, social, interactional, and diachronic. 

Newell (1990: 122) proposes a somewhat more mechanical division of time into 
distinct “bands of cognition” (each consisting of three “scales”). Newell takes the 
abstract/objective temporal unit of the second as a key unit, and defines each 
timescale on a gradient from 1074 seconds at the fast end to 107 seconds at the 
slow end: the biological band (= 1074-107? seconds), the cognitive band (= 1071- 
10! seconds), the rational band (= 10?-10* seconds), and the social band (= 10°-107 
seconds). He also adds two “speculative higher bands”: the historical band (= 10°- 
10'° seconds), and the evolutionary band (= 1011-10! seconds; Newell 1990: 152), 
thus suggesting a total of 18 distinct timescales. 

Like Newell (though without reference to him), Lemke (2000: 277) takes the 
second as his unit and proposes no less than 24 “representative timescales”, be- 
ginning with 1075 seconds - at which a typical process would be “chemical syn- 
thesis” - through to 1018 seconds - the scale of “cosmological processes”. 

Lemke’s discussion is full of insights. But he generates his taxonomy by arbi- 
trarily carving up an abstract gradient. It is not established in terms of research- 
relevant qualitative distinctions or methodological utility, nor is it derived from 
a theory (cf. Uryu et al 2014, 2008: 169). It is not clear, for example, why a distinc- 
tion between units of 3.2 years versus 32 years should necessarily correlate witha 
distinction between processes like institutional planning versus identity change; 
nor why the process of evolutionary change should span three timescales (3.2 
million years, 32 million years and 317 million years) or why it should not apply 
at other timescales. 

Larsen-Freeman & Cameron (2008: 169) propose a set of “timescales relevant to 
face-to-face conversation between two people”: a mental processing timescale of 
milliseconds, a microgenetic timescale of online talk, a discourse event timescale, 
a series of connected discourse events, an ontogenetic scale of an individual’s life, 
and a phylogenetic timescale. Uryu et al. (2014) critique this model for not explain- 
ing why these timescales are the salient or relevant ones, and for not specifying 
which other timescales are “real but irrelevant”. 

Uryu et al. (2014) propose a principled “continuum” of timescales running from 
“fast” to “slow” (11 distinctions in the order atomic, metabolic, emotional, autobi- 
ographical, interbodily, microsocial, event, social systems, cultural, evolutionary, 
galactic) that are orthogonal to a set of “temporal ranges” running from “simple” 
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to “complex” (six distinctions in the order physical universe, organic life forms, 
human species, human phenotype, dialogical system, awareness). Uryu et al’s ap- 
proach applies the notion of ecology to the dynamics of language and its usage 
(see also Cowley 2011, Steffensen & Fill 2014). 

What to make of this array of multi-scale schemes? Some are well-motivated 
but incomplete. Saussure gives a single dynamic frame, leading us to wonder, for 
example, whether we should regard speech processing as nano-diachrony. Vy- 
gotsky gives us three dynamic frames, but does not single out or sub-distinguish 
“faster” frames like microgeny and enchrony. Are we to think of these as pico- 
ontogeny? On the other hand, some schemes give us finer differentiation than 
we need, or offer arbitrary motivations for the distinctions made. What we need 
is a middle way. 


2.2 MOPEDS: A basic-level set of causal frames 


Of the frames discussed in the previous section, six capture what is most useful 
about previous proposals. These six frames are relatively well understood. They 
are known to be relevant to research. They are well-grounded in prior work 
on language and culture. And they are known to be related to each other in 
interesting ways.' This is what we need: a basic-level set of conceptually distinct 
but interconnected causal frames for understanding language. 

Each of the six frames — microgenetic, ontogenetic, phylogenetic, enchronic, 
diachronic, synchronic - is distinct from the others in terms of the kinds of causal- 
ity it implies, and thus in its relevance to what we are asking about language and 
its relation to culture and other aspects of human diversity. One way to think 
about these distinct frames is that they are different sources of evidence for ex- 
plaining the things that we want to understand. I now briefly define each of the 
six frames. 


2.2.1 Microgenetic (action processing) 


In a microgenetic frame, we look at how language and culture are psychologi- 
cally processed. For example, in order to produce a simple sentence, a person 
goes through a set of cognitive processes including concept formulation, lemma 


1 One might wonder if one or more of these frames might be reduced in terms of one or more 
others. It is reminiscent of the idea of reducing social processes to physical ones: Were such a 
reduction possible, it is unlikely to be helpful. 
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retrieval, and phonological encoding (Levelt 1989). Or when we hear and un- 
derstand what someone says (Cutler 2012), we have to parse the speech stream, 
recognize distinct words and constructions, and infer others’ communicative in- 
tentions. 

These processes tend to take place at time scales between a few milliseconds 
and a few seconds. Causal mechanisms at this level include working memory 
(Baddeley 1986), rational heuristics (Gigerenzer et al. 2011), minimization of effort 
(Zipf 1949), categorization, motor routines, inference, ascription of mental states 
such as beliefs, desires, and intentions (Searle 1983; Enfield & Levinson 2006), 
and the fine timing of motor control and action execution. 


2.2.2 Ontogenetic (biography) 


In an ontogenetic frame, we look at how a person’s linguistic habits and abilities 
are learned and developed during the course of that person’s lifetime. Many of 
the things that are studied within this frame come under the general headings of 
language acquisition and socialization. This refers to both the learning of a first 
language by infants (see Clark 2009, Brown & Gaskins 2014) and the learning of 
a second language by adults (Klein 1986). 

The kinds of causal processes seen in the ontogenetic frame include strategies 
for learning and motivations for learning. Some of these strategies and motiva- 
tions can be complementary, and some may be employed at distinct phases of life. 
Causal processes involved in this frame include conditioning, statistical learn- 
ing and associated mechanisms like entrenchment and pre-emption (Tomasello 
2003), adaptive docility (Simon 1990), a pedagogical stance (Gergely & Csibra 
2006), and long-term memory (Kandel 2009). 


2.2.3 Phylogenetic (biological evolution) 


In a phylogenetic frame we ask how our species first became able to learn and use 
language. This is part of a broader set of questions about the biological evolution 
and origin of humankind. It is a difficult topic to study, but this has not stopped 
a vibrant bunch of researchers from making progress (Hurford 2007, 2012; Levin- 
son 2014). 

Causal processes in a phylogenetic frame include those typically described in 
evolutionary biology. They invoke concepts like survival, fitness, and reproduc- 
tion of biological organisms (Ridley 1997, 2004), which in the case of language 
means members of our species. The basic elements of Darwinian natural selec- 
tion are essential here: competition among individuals in a population, conse- 
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quential variation in individual characteristics, heritability of those characteris- 
tics, exaptation, non-telic design, and so forth (Darwin 1859; Dawkins 1976; Jacob 
1977; Mayr 1982). 


2.2.4 Enchronic (social interactional) 


In an enchronic frame, we look at language in the context of social interaction. 
When we communicate, we use sequences of moves made up of speech, gesture, 
and other kinds of signs. The causal processes of interest involve structural rela- 
tions of sequence organization (practices of turn-taking and repair which orga- 
nize our interactions; Schegloff 1968, 2007; Sacks et al. 1974; Schegloff et al. 1977; 
Sidnell & Stivers 2012) and ritual or affiliational relations of appropriateness, ef- 
fectiveness, and social accountability (Heritage 1984; Atkinson & Heritage 1984; 
Stivers et al. 2011; Enfield 2013). 

Turn-taking in conversation operates in the enchronic frame, as do speech act 
sequences such as question-answer, request-compliance, assessment-agreement, 
and suchlike (see Enfield & Sidnell 2014). Enchronic processes tend to take place 
at a temporal granularity around one second, ranging from fractions of seconds 
up to a few seconds and minutes (though as stressed here, time units are not the 
definitive measure; exchanges made using email or surface mail may stretch out 
over much greater lengths of time). 

Enchronic processes and structures are the focus in conversation analysis and 
other traditions of research on communicative interaction. Some key causal el- 
ements in this frame include relevance (Garfinkel 1967; Grice 1975; Sperber & 
Wilson 1995), local motives (Schutz 1970; Leont’ev 1981; Heritage 1984), sign- 
interpretant relations (Kockelman 2005, 2013, Enfield 2013: Chapter 4), and social 
accountability (Garfinkel 1967; Heritage 1984). 


2.2.5 Diachronic (social/cultural history) 


In a diachronic frame, we look at elements of language as historically convention- 
alized patterns of knowledge and/or behavior. If the question is why a certain 
linguistic structure is the way it is, a diachronic frame looks for answers in pro- 
cesses that operate in historical communities. While of course language change 
has to be actuated at a micro level (Weinreich et al. 1968; Labov 1986; Eckert 2000), 
for a linguistic item to be found in a language, that item has to have been diffused 
and adopted throughout a community before it can have become a convention. 
Among the causal processes of interest in a diachronic frame are the adop- 
tion and diffusion of innovations, and the demographic ecology that supports 
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cultural transmission (Rogers 2003). Population-level transmission is modulated 
by microgenetic processes of extension, inference, and reanalysis that feed gram- 
maticalization (Hopper & Traugott 1993). 

Of central importance in a diachronic frame are social processes of group fis- 
sion and fusion (Aureli et al. 2008), migration (Manning 2005), and sociopolitical 
relations through history (Smith 1776; Marx & Engels 1947; Runciman 2009). The 
timescales of interest in a diachronic frame are often stated in terms of years, 
decades, and centuries. 


2.2.6 Synchronic (representation of relations) 


Finally, a synchronic frame is different from the other frames mentioned so far 
because time is removed from consideration, or at least theoretically so. One 
might ask if it is a causal frame at all. But if we think of a synchronic system 
as a true description of the items and relations in a person’s head, as coded, for 
example, in their memory, then this frame is real and relevant, with causal impli- 
cations, even if we see it as an abstraction (e.g., as bracketing out near-invisible 
processes that take place in the fastest levels of Newell’s “biological band”; see 
section 2.1, above). 

In Saussure’s famous comparison, language is like a game of chess. If we look 
at the state of the game half way, a diachronic frame would view the layout in 
terms of the moves that had been made up to that point, and that had created 
what we now see. A synchronic account would do no more than describe the 
positions and interrelations of the pieces on the board at that point in time. For 
an adequate synchronic description, one does not need to know how the set of 
relations came to be the way it is. 

There are two ways to take this. One is to see the synchronic frame as a purely 
methodological move, an abstraction that allows the professional linguist to de- 
scribe a language as a whole system that hangs together. Another - not in conflict 
with the first — is to see the synchronic description of a language as a hypothesis 
about what is represented in the mind of somebody who knows the language. 

A synchronic system cannot be an entirely atemporal concept. At the very 
least this is because synchronic structures cannot be inferred without procedures 
that require time; e.g., the enchronic sequences that we use in linguistic elicita- 
tion with native speaker informants. But a synchronic system is clearly distinct 
from an associated set of ontogenetic processes, on the one hand, and diachronic 
processes, on the other (though it is causally implied in both). We can infer 
an adult’s knowledge of language and distinguish this from processes including 
the learning that led to this knowledge and the history that created the conven- 
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tional model for this knowledge (but which neither the learner nor the competent 
speaker need have had access to). 

The goal here is to define frames that are relevant to a natural, causal account 
of language. So when I talk about a synchronic frame I mean a way of think- 
ing about the conceptual representations of a language that make it possible for 
people to produce and interpret utterances in that language. 

Causality in a synchronic frame is tied to events that led to the knowledge, 
and to events that may lead from it, as well as how the nature and value of one 
convention may be dependent on the nature and value of other conventions that 
co-exist as elements of the same system. 


2.3 Interrelatedness of the frames 


How are these frames interrelated? As Raczaszek-Leonardi (2010: 276) says, 
“even if a researcher aims to focus on a particular scale and system, he or she 
has to be aware of the fact that it is embedded in others”. Other authors (Cole 
1996: 179, MacWhinney 2005: 192) have asked: What are the forces that cause 
these frames to “interanimate” or “mesh”? The way to find out would be to test 
and extend the useful suggestions of authors like Newell (1990), Cole (1996: 184- 
185), MacWhinney (2005), Lemke (2000: 279-286) and Uryu et al. (2014). 

How might the outputs of processes foregrounded within any one of these 
explanatory frames serve as inputs for processes foregrounded within any of the 
others? Answers to this question will greatly enrich our tools for explanation. 


2.4 The case of Zipf’s length-frequency rule 


Why is it good to have a set of distinct causal frames for language? Because it 
offers explanatory power. Consider the observation made by Zipf that “every 
language shows an inverse relationship between the lengths and frequencies of 
usage of its words” (Zipf 1949: 66).” Zipf suggested that the correlation between 
word length and frequency is explained by a psychological preference for mini- 
mizing effort. If we take this as a claim that synchronic structures in language 
are caused by something psychological — though Zipf’s own claims were rather 
more nuanced - this raises a linkage problem (Clark & Malt 1984: 201). 


? I am grateful to Martin Haspelmath for insisting on the distinction between Zipf’s Law and 
Zipf’s length-frequency rule (cf. Newman 2005). Zipf’s Law states that there is a correlation 
between the frequency of an item and its frequency rank relative to other items in a set. His 
length-frequency rule states that the shorter a word is, the more frequently the word is used. 
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The problem is that a person’s desire to minimize effort cannot directly affect a 
synchronic system’s structure. A cognitive preference is a property of an individ- 
ual, while a synchronic fact is shared throughout a population. Something must 
link the two. While it may be true that the relative length of the words I know 
correlates with the relative frequency of those words, this fact was already true 
of my language before I was born. The correlation cannot have been caused by 
my cognitive preferences. How, then, can the idea be explicated in causal terms? 

As was clear to Zipf (1949), to solve this problem we appeal to multiple causal 
frames. We can begin by bringing diachronic processes into our reasoning. A 
presumption behind an account like Zipf’s is that all members of a population 
have effectively the same biases. The key to understanding the status of a mi- 
crogenetic bias like “minimize effort in processing where possible” is to realize 
that this cognitive tendency has an effect only in its role as a transmission bias 
in a diachronic process of diffusion of convention in a historical population (see 
below chapters for explication of diachrony as an epidemiological process of bi- 
ased transmission, following Rogers 2003, Sperber 1985, and Boyd and Richerson 
1985; 2005). The synchronic facts are an aggregate outcome of individual people’s 
biases multiplied in a community and through time. The bias has a causal effect 
precisely in so far as it affects the likelihood that a pattern will spread throughout 
that community. 

Now, while the spread of a pattern and its maintenance as a convention in a 
group are diachronic processes, a transmission bias can operate in three other 
frames. In an ontogenetic frame, a correlation between the shortness of words 
and the frequency of words might make the system easier to learn. This bias 
causes the correlation to become more widely distributed in the population. In 
a microgenetic frame, people may want to save energy by shortening a word 
that they say often, again broadening the distribution of the correlation. And an 
enchronic frame will capture the fact that communicative behavior is not only 
regimented by individual-centered biases in learning, processing, and action, but 
also by the need to be successfully understood by another person if one’s commu- 
nicative action is going to have its desired effect. The presence of another person, 
who displays their understanding, or failure thereof, in a next move - criterial to 
the enchronic frame - provides a selectional counter-pressure against people’s 
tendency to minimize effort in communicative behavior. One’s action has to be 
recognized by another person if that action is going to succeed (Zipf 1949: 21, 
Enfield 2013: Chapter 9). 

If we adopt a rich notion of a diachronic frame in which transmission biases 
play a central causal role, we can incorporate the ontogenetic, microgenetic and 
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enchronic frames in explaining synchronic facts. We do this by invoking the 
mechanisms of guided variation explicated by Boyd & Richerson (1985, 2005) 
and explored in subsequent work by others (Kirby 1999; Kirby et al. 2004; Chris- 
tiansen & Chater 2008; Chater & Christiansen 2010). This allows us to hold onto 
Zipf’s insight, along with similar claims by authors such as Sapir before him, and 
Greenberg after him, who both also saw connections between individual-level 
psychological biases and community-level synchronic facts. Greenberg (1966) 
implied, for example, that there is a kind of cognitive harmony in having anal- 
ogous structures in different parts of a language system. Sapir (1921: 154-158) 
suggested that change in linguistic systems by drift can cause imbalances and 
“psychological shakiness”, which motivates the reorganization of grammar to 
avoid that mental discomfort. 

Similar ideas can be found in work on grammaticalization (Givon 1984; Bybee 
2010) and language change due to social contact (Weinreich 1953), leading to the 
same conclusion: Synchronic patterns can have psychological explanations but 
only when mediated by the aggregating force of diachronic processes. 

The point is central to explaining other observed correlations in language and 
its usage, for example that more frequent words change more slowly (Pagel et al. 
2007), that differences in processes of attention and reasoning correlate with dif- 
ferences in the grammar of the language one speaks (Whorf 1956; Lucy 1992; 
Slobin 1996), that ways of responding in conversation can be constrained by 
collateral effects of language-specific grammatical structures (Sidnell & Enfield 
2012), that tendencies in natural meaning can correlate with universals in the 
sounds of words (Dingemanse et al. 2013), and that cultural values can shape 
grammatical categories (Hale 1986; Wierzbicka 1992; Chafe 2000; Enfield 2002; 
Everett 2005, 2012). But most if not all of these claims bracket out some elements 
of the full causal chain involved. To give a complete and explicit account, multi- 
ple frames are needed. 
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Anyone who wants a natural, causal account of linguistic and other cultural 
transmission will have to study transmission biases. These are the biases that 
ultimately regulate the historical, cumulative transmission of culture. To under- 
stand how the linguistic habits of communities change over generations - in a 
diachronic frame — we must also look in the ontogenetic frame, that is, in the 
process of language acquisition, and the resultant slight differences in habits of 
speech between generations. Language acquisition involves the effective trans- 
mission of a language from parents to children. Imperfections in this transmis- 
sion are sometimes thought to explain language change. Consistent patterns in 
the details of such changes have been documented across a wide range of the 
world’s languages. Many argue that natural paths of semantic change are moti- 
vated by species-wide innate conceptual structure. There are universals in seman- 
tic change, independent from social factors and other factors outside the minds 
and bodies of speakers. But this is only part of the story. Even when new ideas 
for ways of saying things have their source within a single person, the spread of 
that idea follows mechanisms of population-level social transmission. And the 
success or failure of such transmission is ultimately dependent on the biases that 
are the topic of this chapter. 

Cultural transmission can be usefully understood in relation to epidemiology 
(Dawkins 1976; Sperber 1985). We catch ideas from others, in this case ideas for 
attributing meanings to signs. 


An innovation in a language begins its existence in the mouths and minds 
of one or more speakers and spreads from them to other speakers. In fact, 
innovations occur constantly in the speech of individuals, but an inno- 
vation becomes part of the history of the language only when it spreads 
through the network to become a stable feature in the speech of a group 
of speakers. (Ross 1997: 214-215) 


On syntax specifically, Harris and Campbell make a similar point: 


Isolated creative, exploratory expressions are made constantly by speakers 
of all ages. Such expressions may be developed for emphasis, for stylistic 
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or pragmatic reasons (to facilitate communication as in changes to avoid 
ambiguity or to foster easier identification of discourse roles), or they may 
result from production errors. The vast majority of such expressions are 
never repeated, but a few “catch on”. (Harris & Campbell 1995: 54) 


How do they catch on? How do they make this leap from single speaker to 
population-wide? How does an innovation become a stable feature in the speech 
of a group of speakers? In this chapter I discuss a crucial part of the answer 
to this question: the biases that operate in linguistic and cultural change, in the 
diachronic frame. I will define some important biases, and I will say why we need 
a coherent conceptual framework to explain just why we observe the biases we 
observe. 


3.1 Cultural epidemiology 


In the cultural evolution of language, that is, the diffusion, maintenance, and 
change of linguistic practices in historical communities, it is often assumed or 
implied that the unit of analysis is the language system as a whole. But the di- 
achronic replication and transmission of whole language systems is not causally 
conducted directly at the system level (see Chapter 1 above). It is an aggregate 
outcome of a massive set of much simpler and much smaller concrete speech 
events that operate, in enchronic and microgenetic frames, on the parts of a lan- 
guage, such as words or pieces of grammar (Hudson 1996). 

Language systems only exist because populations of linguistic items replicate 
and circulate in human communities, whenever people say things. A causal ac- 
count of language evolution that focuses on the transmission of linguistic items 
can be called an epidemiological view, following Sperber (1985, 1996), and in a 
similar spirit to Keller (1994) and Croft (2000). In an item-based account, the 
pieces of a system can change independently from other pieces, and they can 
be plucked out and borrowed from one system to another. This happens for ex- 
ample when we borrow a word. In diachronic processes, both enchronic and 
microgenetic processes play a role. 

Ultimately we need a causal account for why it sometimes seems like we can 
treat languages as if they were organism-like systems (e.g., when we write gram- 
mars). This is the topic of Chapter 4, below. But first we need to define the 
basic underlying causal anatomy of item-based language transmission. Here I 
outline the basics of a transmission biases approach to the historical evolution 
of languages. 
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3.2 Biased transmission 


The diffusion of cultural items in the diachronic frame is explained in terms of a 
biased transmission model of the distribution of cultural knowledge and practice 
within human populations and across generations, following a general frame- 
work of cultural epidemiology (Sperber 1985, 1996; Boyd & Richerson 1985, 2005; 
Enfield 2003, 2008). In a biased transmission model, the question of whether fash- 
ions of cultural practice in a population spread, decline, transform, or remain as 
they are will be determined by the cumulative effect of biases: filters, pumps, 
and transformers on cultural practices in a competition for social uptake. The 
processes are visible in the diachronic frame, but their proximal causal bases are 
seen in enchronic and microgenetic frames. 

Linguistic and other cultural items are not confined to the mind. Nor are they 
confined to things or actions that can be perceived. They are simultaneously 
manifest in mental and material domains, and in relations between these do- 
mains. At any moment, a community is buzzing with enchronic and microge- 
netic causal chains that constitute continuous lines of production and compre- 
hension of pieces of language and culture. I am referring to people’s courses 
of goal-directed action using words, tools, body movements, and other cultural 
items. 

These courses ofbehavior are contexts in which the natural histories of cultural 
and linguistic items are played out. They constitute causal chains with links from 
mind (I know a word, I understand a tool) to usage (I use the word in conversation, 
I use the tool for a purpose), to mind (the other person learns or recognizes the 
word, an onlooker learns or recognizes the tool’s function, attributing a goal to 
my behavior), to usage, to mind, to usage, to mind, to usage, and so on. This 
type of causal trajectory is a chain of iterated practice, or a cognitive causal chain 
(Sperber 2006). See Figure 3.1 for a simplified illustration. 


Ch ur GE ur GE A) 


Figure 3.1: Simplified illustration of iterated practice, or a social cognitive causal 
chain (Sperber 2006:438). 


Figure 3.1 is not an iterated learning chain, of the kind presented by Kirby and 


colleagues (Kirby et al. 2004, 2008), among others (Christiansen & Chater 2008; 
see below). Those iterated learning depictions resemble Figure 3.1, but they are 
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not the same. In iterated learning (studied to date using small, artificial languages 
in lab settings), each arrow from public to private may represent an entire learn- 
ing process in an ontogenetic frame, such as a child’s learning of a language. 
Each link in the chain is effectively a single macro-level state change in ontogeny 
(e.g., the move from not knowing the language to knowing the language). This 
is shorthand for a huge set of small events and small associated state changes. 

Learning a language involves not one event but many iterations of exposure 
and reproduction. In each micro-occasion of exposure and reproduction there 
is feedback that comes from others’ reactions to how we use words in context. 
This feedback plays an essential role in learning. Both the microgenetic and on- 
togenetic frames are relevant. The iterated learning model abstracts away from 
these details (not without practical reason), while the iterated practice model in 
Figure 3.1 tries to capture them directly and explicitly. 

While iterated learning focuses on the ontogenetic or biographical frame, it- 
erated practice focuses on the enchronic frame, that is, the frame of moves and 
counter-moves in human interaction (see Enfield 2009: 10, 2013: Chapter 4). In 
Figure 3.1, each link in the chain from private-public-private does not represent 
a generation of individuals in a human population (by contrast with the com- 
parable figure in Christiansen & Chater 2008). It represents a generation of in- 
dividuals in a population of items, that is, one local cycle of instantiation of a 
practice, such as a single use of a word, a single performance of a ritual, or a 
single occasion of making bacon and eggs for breakfast. 

The schema in Figure 3.1 draws our attention to a set of bridges that a bit of 
culture has to cross if it is to survive a cycle of iterated practice. What are the 
forces that help things across those bridges, and what are the forces that inhibit 
them? These forces are called transmission biases (following Boyd & Richerson 
1985, 2005). This kind of account assumes a standard model of Darwinian evolu- 
tion - variation of heritable traits in a population — where the variation is guided 
in a specific way. 

As Boyd & Richerson (1985) formulate it, variation of cultural items is guided 
by the properties of people. For example, if a certain way of doing something 
is easier to learn than some other functionally equivalent way (e.g., doing math- 
ematics on a calculator versus on an abacus), then this is likely to increase the 
frequency of the easier variant in the population. All things being equal, this 
variant will also in turn become more frequent simply because it is already more 
frequent. 

Christiansen & Chater (2008) use this idea in arguing that the properties of the 
human brain, e.g., for learning and processing language, favour certain linguistic 
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variants over others. Language is the way it is because it is “shaped by the brain”, 
and thus not because the evolution of a language faculty has caused the human 
brain to change in some fundamental way as a result of the way language is. 
Assuming this model of guided variation, the question then becomes: What 
are the forces that guide variation in this way, and that operate upon variants 
within a population, ultimately determining whether those variants become, or 
remain, conventional in the population? We now consider some known biases. 


3.3 Some known biases 


Variants of cultural behavior compete for adoption by people in populations. Dif- 
ferent researchers have described different biases, sometimes in quite specific 
terms, sometimes in general terms. 

Christiansen and Chater (2008; see also Chater & Christiansen 2010) describe 
four factors that mostly have to do with properties of the individual human body, 
especially the brain. These are (1) perceptuo-motor factors, (2) cognitive limita- 
tions on learning and processing, (3) constraints from mental representations, (4) 
pragmatic constraints. These factors can affect the likelihood that one linguistic 
variant is selected over another. (Ihe social mechanisms that are also a necessary 
part of the process are left implicit by these authors.) 

Boyd & Richerson (1985) introduce distinctions that are broader in kind. They 
illustrate with an example from table tennis. For the function of hitting the ball, 
you can choose between holding the bat with a pencil grip or a handle grip. 
Choosing one of these variants necessarily rules out choosing the other. They 
discuss biases that might cause a person to select one or the other grip. 

A direct bias has to do with the relationship between a variant and a person 
who adopts that variant. It concerns affordances (Gibson 1979). A person should 
choose variant A if it is somehow more advantageous than variant B for a prox- 
imate function in some context. By a direct bias we should choose the grip that 
is easier, more effective, feels better, gives better results. 

An indirect bias has to do with social identity. When a person adopts a variant, 
other people will see. This will lend a certain status to both the adopter (as the 
kind of person who adopts that variant) and the variant (as a variant that is 
adopted by that person or someone like that). People adopt variants of behaviors 
not only for their efficacy but also with some idea of how they will be seen by 
others when they make that choice. So by an indirect bias we should choose the 
same grip as people who we identify with, or want to emulate. 

Finally, a frequency-dependent bias favours variants that are more frequent. 
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Similar biases have been described in a large literature in sociology on the 
diffusion of innovations (Rogers 2003). Here, we can discern three sets of condi- 
tioning or causal factors in the success or failure of a practice. 


1. Sociometric factors have to do with the network structure of demographic 
groups. People are socially connected in different ways, especially in terms 
of the number of their points of connection to others in a social network, as 
well as the quality of these connections. A practice is more likely to spread 
if it is modeled by someone who is widely connected in a network. This is 
because he or she will expose a greater number of people to the practice. 
Gladwell (2000) refers to this as the law of the few: a small number of 
people in group have the biggest influence on the diffusion of innovation. 


2. Personality factors have to do with differences between people in the pop- 
ulation that can affect the success or failure of an innovation. Some people 
are more willing than others to innovate and to adopt others’ innovations 
(early adopters versus laggards). These differences may correlate with so- 
cial categories such as age, class, and sub-culture. Some people are better 
known or better admired in their social milieu and may thus be more likely 
to be imitated. 


3. The utility of an innovation is more or less what Boyd & Richerson (1985) 
refer to as direct bias, outlined above. The innovation will take off if it is 
more advantageous to potential adopters. 


Each of the biases we have just reviewed plays an important role in the mech- 
anisms of transmission that drive the circulation of bits of culture in human pop- 
ulations. But how to explain them? Where do these biases come from and how 
are they related to each other? Can we motivate these biases by locating them 
directly in the causal anatomy of transmission? 


3.4 A scheme for grounding the biases 


One way to justify and limit the number of transmission biases is to motivate 
them in terms of the structure of iterated practice shown in Figure 3.1. This struc- 
ture gives us a way of locating and characterizing the biases. If we look at the 
elements of transmission illustrated in Figure 3.1, we see at the heart of it a re- 
peating, four-stroke cycle consisting of the following steps (see Figure 3.2): 
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exposure reproduction 


representation material 


Figure 3.2: Loci for transmission biases; a four-stroke engine model. 


1. Exposure: a process of going from public (out in the world) to private (in 
someone’s mind), when a person comes into contact with, and perceives 
or engages with, a bit of culture; 


2. Representation: how an idea is created and stored in the mind, based on (1), 
and the private product of this process; 


3. Reproduction: a process of going from private (in someone’s mind) to public 
(out in the world), made possible in part by a person’s motivation to cause 
the same public event as in (1). 


4. Material: the physical result of an event of reproduction of a cultural item. 


5. Stages (3-4) can then lead to another round by exposing another person 
to the cultural item in question (feeding into a new stage (1)). 


Each of the four steps is a possible threshold for any bit of culture to succeed 
or fail in the competition for uptake in a community. If people aren’t exposed to 
it, it will die. If it is difficult to remember or think of, or if in the course of mental 
representation it is radically altered, it will die, or effectively die. If people aren’t 
motivated to reproduce it, no further exposure will happen, and when the people 
who have learned the practice in question die, the practice will die with them. 
This happens for example with language extinction. And if the practice is not 
physically realized, so that others may perceive it, the transmission process will 
stall. 

Failure on any of these four loci of transmission causes a break in the chain 
and may cause the variant to no longer exist. 

Do not get the impression that a single such chain represents the entire histori- 
cal trajectory of a cultural item. It is only the tiniest strand. At any moment, there 
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is a thicket of equivalent chains of iterated practice that keep a bit of language 
or culture alive and evolving in a community. 

Again, the question that a biased transmission approach to linguistic epidemi- 
ology asks is: What are the filters, pumps, and transformers that act upon the his- 
tory of a cultural item? On the present proposal, we can posit four functionally- 
defined loci at which any bias can have an effect. Each locus is defined by the 
function it serves in braking, accelerating, or altering the transmission of prac- 
tices in communities through social-cultural interaction, in an enchronic frame. 

While there may be a long, if not open list of possible biases, they all should be 
definable in terms of how they operate upon one or more ofthe four transmission 
loci, exhaustively defined by the causal structure represented in Figures 3.1 and 
3.2 above: exposure (world-to-mind transition), representation (mind structure), 
reproduction (mind-to-world transition), and material (world structure). Within 
the framework of these basic causal loci for transmission (1-4), different biases 
may affect the transmission of a practice in different ways. 

As sketched above, some of these biases will have to do with facts about so- 
cial networks, some with individual personality traits, some with properties of 
human perception, attention, memory, and action, some with the shape of the 
human body, some with the culture-specific means and ends that come with cul- 
turally evolved structures of activity, some with the organization of complex in- 
formation in cognition. Let us now briefly consider how the previously described 
biases fit within the framework of these minimal loci for cultural transmission. 
Before we start, here is an important point. The goal of this exercise is not to 
locate each bias at just one point in the chain. As we shall see, some biases have 
effects at more than one point. This is one of the things the exercise shows us. 


3.4.1 Exposure 


Exposure - relating to the world-to-mind transition — is where biases can affect 
the likelihood that a person will come into contact with, and pay attention to, a 
practice. 

One type of bias that effects exposure is social connectedness. All people are 
situated in social networks, but they are situated in different ways. One type of 
difference between people has to do with the number of other people we come 
into contact with. Connectors have a large number of social ties (Granovetter 
1973). They are more likely to be exposed to an innovation, or to expose others 
to it. People with fewer social network connections will have a lower chance of 
being exposed to a given practice, or exposing others. 
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Another type of bias relevant to exposure is salience. When you come across 
a new kind of behavior you may or may not pay attention to it. The things that 
stand out will more likely to attract your attention. The definition of “stand out” 
is clearly a matter of perception in the classical sense of affordances, that is, a 
matter of the relationship between a person and a thing being perceived. Some 
things are more likely to be noticed because of the nature of our senses in relation 
to the world. Other things are more salient to us because we are actively looking 
for them, often because our language or culture encourages or requires it. 

A third bias relevant to exposure is identity. Who is the person carrying out 
the practice when it is encountered? If it is somebody who I want to be like in 
some way, then I am more likely to pay attention to what the person is doing and 
how. If it is someone I have no interest in, I will be less likely to pay attention. 
In this way, social identity can play a role in biasing exposure, by affecting the 
extent to which someone will attend, or carefully attend, to the practice when 
encountered. 


3.4.2 Representation 


Representation — relating to mind structure — is where biases can affect the likeli- 
hood that, or the manner in which, a practice will be learnt or stored by a person, 
or how the psychological or otherwise private component of a practice will be 
structured. 

Once we are exposed to a certain pattern of behavior, we can learn it. We 
form a representation of it, attributing to it some meaning or function, and we 
incorporate that representation into an existing framework of knowledge. 

Some innovations are more memorable than others. Some things are more 
easily internalized. This is explained by cognitive preferences that are either 
known from psychological science or that are on that research agenda. 

There are other differences in how things are learnt. Whether you see a thing, 
hear it, feel it, or some combination of these, can have consequences for how that 
thing is interpreted, learnt and understood (Enfield 2009: Chapter 6). This can 
then affect how the new knowledge is applied. For example, it may shape how 
you decide that a practice is an appropriate means for certain ends in a particular 
context. 

There are effects of the psychological context into which a practice is embed- 
ded. Practices are partly constituted by knowledge; knowledge that is caused by, 
and in turn causes, public behavior and associated states of affairs. Knowledge 
has structure, including part-whole relations, hierarchical relations, and other 
sorts of dependency among items in a system. 
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When we learn something, we relate it to other things we know. We do this 
at the very least because the thing stood in a certain relation to other things in 
the context in which we learnt it. As an example, if I learn a new word such as 
deplane, I relate it to other words I already know. There might be similarities with 
other words: debone, derail, decode, decommission. Or associations with other 
features of the language system: deplane is a verb and can be used only with 
specific grammatical roles in English sentences. Or if I learn about the possibility 
of downloadable ringtones I will naturally link this to my existing knowledge 
of mobile phones and the Internet. All of these are examples of a context bias. 
Through a context bias a person is more readily able to learn and psychologically 
represent those things that have an existing “place” in which to fit. 

In language, items are structured into paradigms, syntagms, conceptual frames, 
semantic fields, and other kinds of linguistic systems. While these systems often 
display a degree of symmetry, consistency, and simplicity, change is always tak- 
ing place. In a system, when something happens in one place this will have 
effects in another place. In lexicon and grammar, such system-internal dynam- 
ics can give rise to a certain “psychological shakiness”, as Sapir (1921) put it. As 
noted already in Chapter 2 above, this can lead to reorganization of a system, in 
people’s heads, and then potentially in a whole community. 

Now finally, note that content biases are also relevant to the representation 
locus of transmission. In the broadest sense of meaning, capturing everything 
from the arbitrary meanings of words in languages to the affordance-grounded 
functions of tools (Kockelman 2006), we benefit from what can be called natural 
meaning. If a word or grammatical expression is compatible with other informa- 
tion, for example by having iconic properties, it is better learnt and remembered. 
Similarly for technology, if there is a good match between the intended function 
of a tool and the tool’s natural affordances, then we are more likely to under- 
stand the practice of using that tool, it will be easier to learn, and indeed what 
needs to be stored in the mind is reduced because the relevant information can 
stored materially (Norman 1991). These examples of the content bias pertain to 
learning, storage, and reduction of load on cognition. 


3.4.3 Reproduction 


Reproduction - relating to the mind-to-world transition — is where biases can 
affect the likelihood that a person who is exposed to a kind of behavior will later 
do it themselves. One way to think of this sense of reproduction is whatever 
causes a person to turn the private representation of a practice into an action 
whose production and effects are then perceptible by others. 
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What motivates us to turn knowledge into action? Daily life involves goal- 
directed behavior that is motivated by our beliefs and desires (Davidson 2006; 
Searle 1983; Fodor 1987). I may want to get something done for which I need 
another person’s cooperation. One way to do this is with language. I select 
certain words and grammatical constructions as tools for the job. Depending 
on my goals, I will choose certain words and will thereby choose against all the 
other words I could have used. 

This is the competition among words and grammatical forms invoked in Dar- 
win’s (1871: 60) citation of Max Müller (1870): “A struggle for life is constantly 
going on amongst the words and grammatical forms in each language”. The com- 
petition among different cultural practices operates in the same way. Suppose I 
have a goal. I will have beliefs about how it can be attained. I will have knowledge 
that allows me to act. I can foresee at least some effects of my actions. All this 
points to a powerful bias at the reproduction locus of transmission, concerning 
a person’s functional needs, and the available means to those ends. 

The content bias, again, fits partly under this rubric. As discussed above, a 
content bias favours a practice that is more beneficial in some way to the per- 
son who selects it. Recall that a direct content bias applies when the benefit is 
greater functional payoff, or reduced cost, of the practice, in terms of its primary 
functional effects. In the table tennis example (see section 3.3, above), a direct 
content bias would favour the pencil grip if the pencil grip were lower in cost 
or greater in benefit than the handle grip - that is, in terms of its efficacy for 
getting the ball back over the net and, ultimately, for winning matches. An indi- 
rect content bias is also relevant to the reproduction locus of transmission: the 
choice to use the variant at all will have to do with the effects of whom you might 
show yourself to identify with (or against). There is an extensive literature on 
this in sociolinguistics. Speaking English, I might say guy in one context and 
bloke in another. Maybe there is a slight meaning difference between these two 
words, thus invoking a direct content bias. But these differences may be minimal 
compared to the effect of identifying myself with certain sub-cultural groups or 
kinds of social relationship by virtue of this choice between different word forms 
with near-identical meanings. 

Clearer examples concern pronunciation. Whether I choose to say working or 
workin’ has more to do with who I want to identify with (an indirect bias) rather 
than the meaning I want to convey (a direct bias). In the cultural realm, both a 
Rolex and a Tagheuer will tell the time for a high price but the choice to wear one 
or the other may depend on whether you want to identify with Roger Federer 
versus Tiger Woods (or tennis versus golf). 
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And there is perhaps most often some combination of the two. Do I choose to 
drink this brand of beer over all the rest because it tastes better (a direct bias) or 
because by doing so I identify with some person or group of people (an indirect 
bias)? It could be both. In any case, the mechanisms at play will bias a person’s 
motivation for selecting one practice over all the others that he thereby does not 
select. 

The indirect bias is also sometimes called a model bias. An important distinc- 
tion can be made here depending on the age of the person concerned. How does 
a child select which variants of a practice to adopt? A conformity bias favours 
those practices that “everyone else” adopts (Boyd & Richerson 1985; Gergely & 
Csibra 2006). Another term for this bias is docility (Simon 1990). This refers to 
an adaptive propensity to do what other members of your group do, and in the 
same ways, without wondering why. An infant’s model group will tend also to 
consist of the people who she is genetically most closely related to. The effect is 
that cultural practices and genes tend to (but need not) have parallel histories. 

As people grow up and come to be regarded full members of their group, they 
come across a greater number and range of cultural items. They keep learning. 
So at any time they may find themselves with new choices. This may be because 
they encounter other ways of doing things than the way “my people” do things. 
This happens when they come into contact with other groups, for instance in 
trading, ritual and other kinds of inter-group social interaction. Different people 
in a community will have different degrees of mobility, sometimes as a result of 
personality, sometimes as a result of gender (men often travel more widely than 
women), age or sub-culture. 

At a later age, there is a greater degree of choice and therefore greater compe- 
tition between choices. We may or may not consciously deliberate about such 
choices. But as adults we may be more aware of the meanings of different options. 
Here is where the indirect bias looks more like the model bias exploited in com- 
mercial advertising. This bias applies in all diffusional processes by favouring 
practices that are modeled by, for example, more admired or charismatic people. 


3.4.4 Material 


Material — relating to world structure — is where biases can affect the way in 
which a practice will be physically perceived. 

Biases on the material locus of transmission have to do with the physical affor- 
dances of cultural practices, and the ways in which these affordances affect the 
exposure and reproduction of those practices. Material-related biases can affect 
exposure-related biases in some obvious ways. The material nature of speech is 
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such that it fades almost instantly (gesture slightly less so, etc; see Enfield 2009). 
But when language is reproduced in writing, this evanescence is dramatically 
lessened, and the dynamics of transmission are significantly affected. 

Outside of language, we see similar contrasts. Many activities, like adopting a 
certain grip for table tennis, can only be seen momentarily. They are only avail- 
able for exposure simultaneously with the reproduction process that potentially 
constitutes the transmission event (photos, etc., aside). The table tennis bat itself, 
however, has a more persistent physical existence, and can stand as a public sign 
for the possible ways people might handle it (Norman 1988; Kockelman 2006). 

Material-related biases have to do with the ways in which cultural practices are 
made public, and how their form of public existence might affect their availability 
in the exposure-reproduction cycle we have been exploring here. 


3.4.5 Networks 


If the above-mentioned elements are an engine for the transmission of innova- 
tion, then social networks are the paths that innovations take. The career of an 
idea may theoretically be mapped in a large but finite network (Luce 1950, Miller 
1951: Chapter 12, Milroy 1980; Ross 1997). 

In fashion and other kinds of social epidemic, the success of an innovation will 
partly depend on the ways in which people’s personalities differ. As Gladwell 
(2000) accessibly lays out, different personality types contribute to the diffusion 
of innovation in complementary ways. Connectors have a high number of weak 
social connections, in a range of social spheres. Mavens are actively interested in 
the market, and want to share their knowledge and opinions. Salesmen are the 
charismatic, persuasive ones who model innovations and effectively sell them. 
Innovators are the risk-takers who try things before anyone else does. They are 
followed by early adopters, the early majority, the more conservative late major- 
ity, and finally, the laggards. 

When all of these types of people come into contact, they form social networks. 
The approach to language in terms of networks was pioneered in sociolinguis- 
tics by Milroy (1980), and also taken up by Le Page & Tabouret-Keller (1985), Ross 
(1997), and others. Milroy (1980) developed a method for studying linguistic vari- 
ation based around the idea of social networks, “the informal social relationships 
contracted by an individual” (Milroy 1980: 174), which “can be used to account 
for variability in individual linguistic behavior in communities” (Milroy 1980: 21). 
The social network model “treats speakers as nodes in a social network, such that 
each speaker is connected with other speakers by social (and therefore commu- 
nication) links” (Ross 1997: 213). The idea is to map the network of contacts that 
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each individual has. Milroy suggested that networks could be placed on a scale of 
density, from low to high. In a low density network, a may be in regular contact 
with b, c, and d, but b, c, and d are never in contact with each other. In a high 
density network, a, b, c, and d are all in contact with each other. 

Usually, contacts between two people are made in the presence of other net- 
work members. So, to the high density network, we could add the ties a-b-c, 
a-b-d, a-c-d, b-c-d, and a-b-c-d. 

The network concept contributes “to analysis of the manner in which individ- 
uals utilize the resources of linguistic variability available to them” (Milroy 1980: 
175). In work with Li on the topic of code-switching, Milroy writes: 


(A) network analysis can... form an important component in an integrated 
social theory of language choice. It links the community with the interac- 
tional level in focusing on everyday behavior of social actors. ... The link 
with the economic and sociopolitical level derives from the observation 
that networks seem to form not arbitrarily but in response to social and 
economic pressures. (Milroy & Li 1995: 155) 


While “density” refers to the intensity of contact among network members, 
there are distinctions in the quality of relationships between any two network 
members. A distinction between exchange and interactive networks was sug- 
gested by Milardo (1988), to which Milroy and Li add passive network ties: 


Exchange networks constitute persons such as kin and close friends with 
whom ego not only interacts routinely, but also exchanges direct aid, advice, 
criticism, and support - such ties may therefore be described as “strong”. 
Interactive networks on the other hand consist of persons with whom ego 
interacts frequently and perhaps over prolonged periods of time, but on 
whom ego does not rely for personal favours and other material or sym- 
bolic resources — such ties may be therefore described as “weak”. An ex- 
ample of an interactive tie would be that between a shop-owner and a 
customer. In addition to exchange and interactive ties, we identified a “pas- 
sive” type of network tie, which seemed particularly important to migrant 
families. Passive ties entail an absence of regular contact, but are valued 
by ego as a source of influence and moral support. Examples are physically 
distant relatives or friends. (Milroy & Li 1995: 138-139) 


The key point is that sociolinguistics and network analysis give us a valuable 
matrix in which a four-stroke diffusion engine operates, modulated as it is by 
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transmission biases (see especially Rogers 2003 for a rich review of cases and 
analyses of the diffusion of social innovation). 


3.5 Causal anatomy of transmission 


A causal explanation of linguistic reality must include the role of transmission 
biases in the diffusion of innovations in social networks. A good diachronic ac- 
count of language change must be explicit about the proximal causal anatomy of 
the process, operating in microgenetic, enchronic, and ontogenetic frames. Pre- 
vious work has usefully identified and described transmission biases, but one 
might ask: Why these biases? What other biases might we predict are possible? 
How many might there be? 

We can answer these questions with reference to the basic, proximal causal 
anatomy of social transmission. It is powered by a four-stroke engine, a causal 
chain in the enchronic frame, from exposure to representation to replication to 
material instantiation, back to exposure and round again. A transmission bias is 
any force that serves as a filter, pump, or transformer for this process, with effects 
on any of the links in the potentially open-ended chain of iterated practice. 

A next step is to see how well we can explain the known and understood bi- 
ases within this four-stroke engine framework, and to see what predictions can 
be made and tested. This should connect to research on the puzzle of how our 
species evolved the capacity for cumulative culture (Tomasello 1999), a capacity 
that is strongly pronounced in humans but weak if present at all in our closest 
relatives, the other apes (Herrmann et al. 2007). While we can readily assume 
that other animals are engaged in goal-directed courses of action, and that they 
select from among different means for fixed ends in both the social and material 
realms, their selection of means for ends is relatively less flexible than that of 
humans. What is the link to transmission biases? We might assume that a chim- 
panzee, say, will be guided in its selection of a behavioral strategy by a strong 
content bias, incorporating a basic min-max payoff logic: keep effort to a mini- 
mum while ensuring the desired outcome. But if its repertoire of strategies is, on 
the whole, not being acquired by learning from others - but, say, learned by rit- 
ualization during the course of life, in an ontogenetic frame - then transmission 
biases will have no traction. 
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When accounts of social-cultural transmission are explicit about the causal pro- 
cesses involved, they often take cultural items — rather than systems - as their 
unit of analysis. This works well but it is awkward because we know that cul- 
tural items don’t exist in isolation. We can only make sense of cultural items in 
the context of a system of cultural meaning. This brings us back to the puzzle, 
foreshadowed in Chapter 1, of causal units. 

Higher-level systems like languages and cultures show enormous coherence 
of structure, so much so that we are seduced into thinking of them as organisms 
with bodies (see classic statements of philologists von der Gabelentz 1891 and 
Meillet 1926: 16). Here is Gabelentz: 


Language is not a mere collection of words and forms, just as the organic 
body is not a mere collection of limbs and organs. Both are in any stage of 
their life (relatively) complete systems, dependent on themselves; all their 
parts are interdependent and each of their vital manifestations arises from 
this interaction. (von der Gabelentz 1891: 10) 


Compare this to the situation in vertebrate biology. Genes are distinct entities 
yet they “form alliances” thanks to the bodies and body plans in which they are 
instantiated (Gould 1977, cited in Dawkins 1982: 117). 


Every gene in a gene pool constitutes part of the environmental back- 
ground against which the other genes are naturally selected, so it’s no won- 
der that natural selection favors genes that “cooperate” in building these 
highly integrated and unified machines called organisms. Biologists are 
sharply divided between those for whom this logic is as clear as daylight, 
and those (even some very distinguished ones) who just do not understand 
it - who naively trot out the obvious cooperativeness of genes and unitari- 
ness of organisms as though they somehow counted against the “selfish 
gene” view of evolution. ... By analogy with coadapted gene complexes, 
memes, selected against the background of each other, “cooperate” in mu- 
tually supportive memeplexes. (Dawkins 1999: xv) 


4 The item/system problem 


Vertebrates have bodies while cultural systems do not. Still, the item/system 
link needs to be accounted for in both cases. With both bodies and memeplexes, 
sets of items somehow hold together as systems. But the causal forces are differ- 
ent. The pieces of a cultural system are not held together at any stage by physical 
attachment to a shared material whole. So this is our puzzle. If languages and 
other cultural systems hang together, what is the binding force? We have seen 
that cultural transmission involves causal processes that apply only to small parts 
of the larger whole. What explains the coherence of that larger whole? This is 
the item/system problem. 

Here is the solution. The ideas of cultural item and cultural system are recon- 
ciled by something that they have in common: Neither idea exists without the 
simpler idea of a functional relation. A word - kangaroo, for example - is eas- 
ily thought of as a distinct cultural item. You can cite it or borrow it without 
having to also cite or borrow the language system that it comes from. But the 
word cannot be defined or understood — nor can it exist — except in terms of its 
functional relation to other things, things like the words it co-occurs with, the 
conversations in which it is used for referring to kangaroos, and so on. The same 
is true for technology. A spoke can be designed, named, bought, and sold, but as 
a cultural item, a spoke doesn’t make sense without a wheel. And while a wheel 
is a whole when thought of with reference to a spoke, it is a part when thought 
of with reference to a vehicle, and so on. 

In sum: An item doesn’t make sense without functional relations to other 
things, just as a system doesn’t make sense without the functional relations that 
it contains. Functional relations are the interface that joins items and systems 
together. We can look to functional relations for a solution to the item/system 
problem. 


4.1 A transmission criterion 


In the causal ontology of culture, there is a transmission criterion. A social fact 
— by definition — would cease to exist if individual people stopped behaving as 
if it existed (Searle 2010). And social facts endure with relative stability beyond 
individual people’s lifetimes. Therefore, social facts must be transmitted among 
individuals in human populations in order to (i) exist and (ii) endure with relative 
stability. Transmission is a necessary part of what makes culture and language 
the way they are. 

A causal understanding of culture depends, then, on knowing how culture is 
transmitted within human groups and across generations. Much is known about 
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how items are transmitted (Rogers 2003), but macro-level cultural systems cannot 
be transmitted in the same way. 

Do we need two separate accounts of transmission, one for items, one for sys- 
tems? I am going to argue that we can derive system transmission from item 
transmission, on the condition that we have a more accurate definition of items. 
We can define items not as cultural things but as cultural things with functional 
relations to other cultural things. Cultural items are specified for — and advertize 
- their relations to the contexts into which they fit (where, it must be said, this 
fit can be quickly and easily re-tooled). As Kockelman (2013: 19) writes: “there 
are no isolated environments and organisms, there are only envorganisms. 


4.2 Defining properties of systems 


To understand what a cultural system is, begin with the idea of a cultural item. 
This is any seemingly detachable conceived entity such as a piece of technol- 
ogy, a technique, a way of saying something, a value. An item can be readily 
defined and labeled, and can be learned and borrowed from one human group 
into another (though typically with a change of meaning in the new context). 
Object-like things such as tomahawks might be prototypical items, but the idea 
of item intended here also includes train tracks, AC current, and mother-in-law 
avoidance. 

By contrast, a cultural system is a coherent set of such items, each item related 
to the others. A system has a holism that goes beyond the sum of the parts, in 
the sense that the full meaning of any individual cultural item is determined by 
how it functions in relation to other things in context. Often, we cannot observe 
the system directly or in one go, as for example in the case of a language or 
a telecommunications infrastructure, though this is sometimes made virtually 
possible by means of signs of these systems that scale them down in such a way 
as to produce a “tangible expression”, as Durkheim (1912: 208) put it, of the more 
diffuse phenomenon. 

A book can contain a grammatical description of a language. A diagram can 
portray the elements of a telecommunications system in miniature. In these cases 
a representation of the system is created or inferred from an aggregate of encoun- 
ters with context-situated items. These itemized emblems are different from the 
real systems they represent, and they have different collateral effects as a result 
of their form. A grammar book, for example, can be held up in one hand. This 
helps to promote the idea that a language is a finite, bounded thing; in short, an 
item. 
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As we now turn to examine systems in more detail let me emphasize that nei- 
ther items nor systems can be understood, nor indeed can they exist, without 
the relations that are inherent in both. Relations are definitive for both items and 
systems. If something is an item, relations define its functions. If it is a system, 
relations define its structure. 

A system should have at least these three properties: 


1. It can readily be construed as a thing with multiple inter-related parts. 
2. Effects on one part should have effects on other parts. 


3. The parts should together form a whole in the sense that they are more 
closely related to each other than they are to things outside the system. 


Good examples are biological or ecological systems. In a food chain, popula- 
tions of different species are inter-related. Changes in the frequency or behavior 
of one species will affect the frequency or behavior of others. While each species 
in the ecosystem will ultimately be connected to entities outside the focal food 
chain system, the integration within the system is greater. 

Clearly, on all three counts, whether or not we are looking at a system is ulti- 
mately a matter of construal. To say that some entities form a system is partly 
just a way of looking at those entities. 


4.3 Relations between relations 


Culture and language hinge on shared meaning, and so the systems we are in- 
terested in here are semiotic systems. The core idea of a semiotic system is well 
illustrated in Darwin’s account of the expression of emotion in animals. Darwin 
introduces a principle of functional connection between a sign and what it stands 
for. 

In his example, the visible features of a dog in a “hostile frame of mind” - 
upright, stiff posture, head forward, tail erect and rigid, bristling hairs, ears for- 
ward, fixed stare - are intelligible because they recognizably “follow from the 
dog’s intention to attack”. Figure 4.1 is Darwin’s illustration. 

These behaviors are functionally connected to the aggressive attitude, and so 
others may take them to signal that attitude. This can be illustrated as in Fig- 
ure 4.2. 

This is only a first step toward establishing a semiotic system. Figure 4.2 shows 
a relatively simple semiotic relation. There is a potential positive association be- 
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Figure 4.1: Darwin’s illustration of a dog in hostile frame of mind (Figure 5 from 
The Expression of the Emotions in Man and Animals). 


stiff posture, etc. 


stands for 


hostile frame 
of mind 


Figure 4.2: A “functional”, indexical association between observable behavior 
and frame of mind (after Darwin). 
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tween an observable behavior and a frame of mind. Whoever makes this asso- 
ciation might produce a number of relevant interpretants, for example running 
away, grabbing a big stick, or adopting an attacking posture. 

Darwin then argues for a second signalling principle, which he calls antithesis. 
The dog can exploit the already established semiotic relation shown in Figure 4.2 
to express the opposite of aggression. He does this by “reversing his whole bear- 
ing”, that is, doing the “opposite” of what he would do when aggressive. So, when 
approaching his master in an affectionate attitude, his visible behaviors will in- 
clude body down, flexuous movements, head up, lowered wagging tail, smooth 
hair, ears loosely back, loose hanging lips, eyes relaxed. Figure 4.3 is Darwin’s 
illustration. 


Figure 4.3: Darwin’s illustration of a dog in an affectionate attitude (Figure 6 from 
The Expression of the Emotions in Man and Animals). 


None of [these] movements, so clearly expressive of affection, is of the 
least direct service to the animal. They are explicable, as far as I can see, 
solely from being in complete opposition to the attitude and movements 
which are assumed when a dog intends to fight, and which consequently 
are expressive of anger. (Darwin 1872: 15-16) 


As depicted in Figure 4.4, antithesis is a secondary relation. It is a relation be- 
tween relations. As Darwin pointed out, this secondary relation is only possible 
if the interpreter has already recognized a primary functional relation. But there 
is something more that it depends on, something crucial to the idea of a semiotic 
system. It follows from the meaning of the term opposite. 
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‘opposite of’ 


. flexuous 
stiff posture, etc. TI 


movements, etc. 


stands for 
stands for 


hostile frame opposite of affectionate 
<< 


of mind frame of mind 


Figure 4.4: A secondary indexical association between observable behavior and 
frame of mind (at right), deriving its meaning only in connection with 
the established relation illustrated in Figure 4.2 (and incorporated at 
left of this Figure), assuming the interpreter’s knowledge of a limited 
range of possible bodily behaviors, on the one hand, and a limited set 
of frames of mind, on the other (after Darwin). 


To see that a certain behavior is “the opposite” of some other behavior, as op- 
posed to simply not that other behavior, you must be able to consider alternative 
possibilities within a restricted set. Flexuous movements can be recognized as 
the opposite of the aggression-signaling behavior only when one knows, or can 
predict, a limited range of postures that a dog can make. For this to work in the 
way depicted in Figure 4.4, you must also understand that there is a limited set 
of relevant frames of mind that the dog may have, with aggressive at one end 
and affectionate at the other. 

This type of semiotic system arises when Darwin’s principle of antithesis sets 
up relations between relations (Kockelman 2013: 12-17). This becomes possible 
when someone has access not just to what they are currently perceiving (e.g., a 
dog in a certain posture) but when the person also knows about other systems 
such as body posture and emotional state, with some sense of their elements and 
the logical-causal relations between them. A person should understand that if a 
dog is being affectionate it is necessarily not being aggressive, or that if its body 
is stiff it cannot also be flexuous. 

Central to the idea of a functional relation to context that I am outlining here are 
the concepts of incorporation and contextualization. These are defined in semiotic 
terms by Kockelman (2006: 29), as follows: 
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Incorporation. For any two semiotic processes, A and B, A will be said to 
incorporate B (and hence be an interpretant of it) if the sign of B relates 
to the sign of A as part-to-whole, and the object of B relates to the object 
of A as means-to-ends. For example, in the case of instruments (semiotic 
processes whose sign is an artificed entity and whose object is a function), 
a wheel incorporates a spoke. 


Contextualization. For any two semiotic processes, A and B, A will be said 
to contextualize B, if A is required to interpret B, or at least assists in in- 
terpreting B. For example, a hammer contextualizes a nail. And a sword 
contextualizes a sheath. That is, nails make no sense without the existence 
of hammers; and sheaths make no sense without the existence of swords. 


The concepts of incorporation and contextualization help us to define func- 
tional relations. They hold, for example, for the relations between a verb and 
a clause, a handle and a knife, a marriage rule and a kinship system. They ac- 
count for relations between concepts and the larger frames that contextualize 
them (Fillmore 1982). They are the basis of combinatoric rules, and as such they 
ultimately account for grammar in the complete sense (assuming a semantically- 
based approach to grammar; cf. Langacker 1987; Wierzbicka 1988; Croft 2000; 
Haspelmath 2007). 


4.4 More complex systems 


The basic relations-between-relations structure shown in Figure 4.4 combines 
with incorporation and contextualization - kinds of embedding relations — to 
yield the sorts of semiotic systems that make up any natural language (Saussure 
1916; see Dixon 2010, 2014, Bickel 2014). 

All languages have systems of form classes. The thousands of words (and other 
morphemes) that you have to learn in order to speak a language can be catego- 
rized according to how they are distributed relative to each other. There are 
open classes of content words like nouns and verbs (in most if not all languages) 
versus closed classes of function words like prepositions (e.g., in English) and 
case-marking affixes (e.g., in Finnish). 

Then there are constructional systems defined by principles of combination. 
An example is the system for describing motion events in Lao (Enfield 2007: 387- 
389). There are three consecutive slots. Each slot may be filled with a verb from 
three distinct sets. The first verb refers to the manner of motion (this is an open 
set). The second refers to the path of motion (from a set of 10 verbs). The third 
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refers to the direction of motion in relation to the deictic center (from a set of 3 
verbs). See Table 4.1. 


Table 4.1: Lao directional verb system 


Slot 1 Slot 2 Slot 3 

Verb of manner Verb of path Verb of direction 
(open class) (closed, n=10) (closed, n=3) 
léén1 ‘run’ khun5 ‘ascend’ paj3 ‘go’ 
ñaang1 ‘walk’ long2 ‘descend’ müa2 ‘return’ 
king4 ‘roll’ khaw5 ‘enter’ maa2 ‘come’ 
lùan1 ‘slide’ qook5 ‘exit’ 

tén4 ‘jump’ khaamö5 ‘cross.over’ 

166j2 ‘float’ 16614 ‘cross.under’ 

khiil ‘ride’ taam3 ‘follow’ 

khaan2 ‘craw? phaanl ‘pass’ 

taji ‘creep’ liap4 ‘go along edge’ 

com1 ‘sink’ qoom4 ‘go around’ 

doot5 ‘leap’ 

etc. 


Using this system, a Lao speaker can say things like this: 
(1) khaan2 qook5 paj3 
crawl exit go 


‘(S/he/it) crawled out/away. 


(2) doot5long2 maa2 
leap descend come 


“(S/he/it) leapt down here! 


(3) Joo phaan1 mua2 
float pass return 
‘(S/he/it) floated back past.’ 


This linguistic sub-system illustrates a fundamental intersection between two 
axes. A syntagmatic axis is the “left-to-right” axis along which separate elements 
combine. On a paradigmatic axis, each slot along the syntagmatic axis may be 
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filled by alternative members of a set, with contrast effects between possible 
values (not unlike the way a dog’s stiff posture is opposed to a flexuous posture). 

Sub-systems in language interact with each other and show dependencies in 
higher-level systems like those defined in comprehensive grammatical descrip- 
tions. Aikhenvald & Dixon (1998) describe dependencies among grammatical 
sub-systems. They point out, for example, that the system of polarity (positive 
versus negative in relation to a predicate or clause) puts constraints on other sub- 
systems in the grammars of many languages. For example, in Estonian, there is 
a system in which person and number are distinguished by morphological mark- 
ing on the verbs, but these distinctions are only realized in positive polarity. The 
distinctions are lost in the negative. See Table 4.2. 


Table 4.2: Verb ‘to be’ in Estonian 


POSITIVE NEGATIVE 


olen (1sc), oleme (1PL) 
oled (2sc), olete (2PL) ei ole (1/2/3sG/PL) 
on (3sG/PL) 


Aikhenvald & Dixon (1998) present a cross-linguistic hierarchy of dependen- 
cies between sub-systems like these. This kind of inter-connectedness between 
paradigm sets and combinatoric rules, and between sub-systems in a language, 
is evidence for the broad underlying system properties of linguistic behavior. 

It follows from these facts about linguistic systems that we cannot view any 
piece of language as a mere item. “A living language is not just a collection of au- 
tonomous parts”, say Donegan & Stampe (1983: 1). A language is “a harmonious 
and self-contained whole, massively resistant to change from without, which 
evolves according to an enigmatic, but unmistakably real, inner plan” (Donegan 
& Stampe 1983: 1). 

They illustrate their point in explaining how it is that the languages of two 
sides of the Austroasiatic language family - Munda and Mon-Khmer - show a list 
of typological distinctions that are “exactly opposite at every level of structure” 
(Donegan & Stampe 2002: 111) even though they are known to be descended from 
the same proto-language. Donegan and Stampe argue that speakers of Munda 
innovated a new prosodic profile, and when they did this they were tampering 
with something that “pervades every level of language structure” (Donegan & 
Stampe 1983: 14). A simple change from iambic to trochaic stress in words had 
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systemic knock-on effects that changed the entire morphosyntactic profile of the 
language. Table 4.3 is adapted from Donegan & Stampe (1983: 1-2).! 


Table 4.3: Properties of Munda and Mon-Khmer languages 


Munda Mon-Khmer 
Phrase accent Falling (initial) Rising (final) 
Word order Variable-SOV, Rigid-SVO, 
AN, Postpositional NA, Prepositional 
Syntax Case, verb agreement Analytic 
Word canon Trochaic, dactylic Iambic, monosyllabic 
Morphology Agglutinative, suffixing, Fusional, prefixing 
polysynthetic or isolating 
Timing Isosyllabic, isomoric Isoaccentual 
Syllable canon (C)V(C) Unaccented (C)V, 


Consonantism 


Tone/register 


Vocalism 


Stable, 
geminate clusters 


Level tone (Korku only) 


Stable, monophthongal, 
harmonic 


accented (C)(C)V(G)(C) 
Shifting, tonogenetic, 
non-geminate clusters 


Contour tone/register 


Shifting, diphthongal, 


reductive 


As the examples discussed here show, there are good reasons to believe that 
languages have higher-level system properties. Yet there is no single causal event 
in which a language as a whole system is transmitted, at least not in the same 


sense as the single causal event of sexual reproduction by which a full set of 
genetic information is transmitted in vertebrates. Below, I return to the trans- 


mission problem. But first, I want to broaden the scope and show that the point 
I have just made for language also holds for social and cultural systems. 


1 Donegan and Stampe of course considered the possibility that language contact explains the 
data in Table 4.3. Their goal was to argue against a contact account, with their knock-on effect 


idea being offered as an alternative. Whether they are right remains an open question. Neither 


contact nor internal development can be treated as a null hypothesis. Proponents of both 
arguments are obliged to make their case. 
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As an illustration of the system concept in another domain of culture, consider 
sections and subsections in Aboriginal Australia (Radcliffe-Brown 1931). In a sec- 
tion system, all members of a community belong in one of four categories. Each 
category has a name in the local language (e.g., in the Alyawarre language of 
Central Australia they are Kngwarriya, Upurla, Pitjarra and Kimarra). For conve- 
nience we can label them A, B, C, and D. 

As McConvell (1985: 2) describes it, in a four-term section system “a man of A 
marries preferentially a woman of B; their children are D. A man of B marries a 
woman of A; their children are C. C and D similarly marry each other, and their 
children are A if the mother is C and B if the mother is D”. After two genera- 
tions of this, one ends up in the same section as one’s father’s father or mother’s 
mother. See Figure 4.5. 


A = B 
C = D 
marriage = 


matriline —> 
patriline => 


Figure 4.5: Sections (Northern Australia), from McConvell (1985: 32), after 
Radcliffe-Brown (1931). 


McConvell also describes the doubly complex subsection systems. In a sub- 
section system, the four categories of what used to be a section system are each 
divided in two (see McConvell for diagram and discussion). There are structural 
consequences. For example, a cross-cousin is a possible wife in a section system, 
but not in a subsection system. 

These kinds of system are widespread in Aboriginal Australia. They are shared 
by groups that have completely different languages. Evans (2012) compares the 
situation to that of the modern system of military ranks as officially standardized 
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by the Geneva Convention: groups in the same culture area have direct transla- 
tions for the same offices in what is essentially the same system. In Northern 
Australia, a common cultural context has facilitated the widespread and stable 
status of particular types of kinship systems and vocabularies. 

But there are many aspects of culture that seem less like systems and more 
like items. Eckert (2008) gives the example of a cut of jeans that happens to 
be fashionable among high school kids one year, though she urges us not to be 
tempted by the apparent individuability of such cultural elements. Something 
like the wearing of pegged pants or a way of pronouncing a vowel is always 
situated in an indexical field, as she puts it. When things like these are borrowed 
or adopted into new social settings they may be segmented out from a historical 
and indexical constellation of signs and meanings. 

People who do this segmenting may be unaware of the larger (especially his- 
torical) connections. They will nevertheless give the item a place in a new system. 
Parry & Bloch (1989) make this point in connection with the historical adoption 
of money around the world: “in order to understand the way in which money is 
viewed it is vitally important to understand the cultural matrix into which it is 
incorporated” (Parry & Bloch 1989: 1). 

Sahlins (1999) says that when new elements - everything from money to snow- 
mobiles — are incorporated into cultural contexts, they are adopted for local pur- 
poses and given a “structural position” in “the cultural totality”. Sahlins cele- 
brates the appropriation by neotraditional people of elements from other people 
(and note we can distinguish between processes of appropriation that alter the 
item so as to make it fit into the receiving system versus those that alter the 
system so as to fit the incoming item; usually it is a combination of the two). 

Sahlins is criticizing the idea that cultures like the Yupik become contaminated 
when people borrow modern innovations. His point is that once the items in 
question are borrowed, they are changed. They have new meanings in their new 
contexts. 


4.5 Are cultural totalities illusory? 


Consider the kinds of systems and relations of incorporation in language and 
culture just discussed. They show that we are never dealing with detached cul- 
tural items. But it does not follow from the striking systematicity of Australian 
sections and subsections that these ramp up into cultural totalities. It’s possi- 
ble that they do. After all, ethnographers have succeeded in writing reference 
descriptions of the knowledge, practices, values, and technologies of defined so- 
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cial/cultural groups (Radcliffe-Brown 1922; Malinowski 1922; Firth 1936; Evans- 
Pritchard 1940; Fortes 1945). In the same way, linguists have succeeded in describ- 
ing languages as totalities, not in the way a layperson might discretely label an 
imagined language - Dutch, Flemish, Thai, Lao, etc. - but rather in the technical 
sense of listing the full vocabulary and set of grammatical rules that any speaker 
in a community should know. 

What is our evidence that such totalities exist? Both the “whole systems” and 
the “parts” of language seem clearly identifiable at first, but both ideas crumble 
upon close inspection (Le Page & Tabouret-Keller 1985; Hudson 1996). Any lin- 
guist knows that “a language” - in the sense of a community-wide system like 
French or Korean - is impossible to define by pointing at it: “as a totality it is 
inaccessible and indefinable; each of us has only partial experience of it” (Le Page 
& Tabouret-Keller 1985: 191). 

“A language” in the sense that we normally mean it constitutes a system insofar 
as it is a set of interrelated items, such as words, each of which appears to be 
a stand-alone unit or element. The system idea is especially clear in the case 
of language for at least three reasons. First, the set of interrelated items in a 
language is a very large set. Second, we have strong intuitions about what is part 
of language and what is not. Third, this set contains numerous sub-systems. But 
still we never encounter a language as such, only fragments of languages, items 
like words and grammatical constructions, in contexts of speech and writing. 

In their masterpiece on the nature of language, Le Page & Tabouret-Keller 
(1985: 8-9) challenge us to face the problem of “how to know when to speak 
of separate systems”: 


If we start from the concept of an underlying system this becomes an ex- 
tremely difficult, if not insoluble, problem; if however we approach it from 
the point of view of the degree of coherence evidenced in the behavior of 
a group of individuals, the problem is seen to be one of relationships and 
of stereotypes inherent in each individual. 


Metalinguistic stances are real. But this does not mean that the systems those 
stances point to are real in the same way. How, then, can we have a clear causal 
account of linguistic systems? The answer - to bring us back to the item/system 
problem - is in the causality of social behavior at the micro level. 
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Do cultural totalities exist? As members of a group we may feel certain that there 
is a cultural totality around us. But we never directly observe it. As Fortes (1949: 
56) put it: 


Structure is not immediately visible in the “concrete reality”. It is discov- 
ered by comparison, induction and analysis based on a sample of actual 
social happenings in which the institution, organization, usage etc. with 
which we are concerned appears in a variety of contexts. (Fortes 1949: 56) 


This mode of discovery is not only used by ethnographers who are studying 
culture. It is also used by children whose task is to become competent adults (see 
Brown & Gaskins 2014). 

If our experience of culture is in the micro, how do we extrapolate to the 
macro? When Parry & Bloch wrote about money and its status, they stressed 
that there are local differences between cultures and the effects on the meaning 
that money comes to have. But they also acknowledged a certain unity across 
cultures: 


[This unity is] neither in the meanings attributed to money nor in the moral 
evaluation of particular types of exchange, but rather in the way the total- 
ity of transactions form a general pattern which is part of the reproduction 
of social and ideological systems concerned with a time-scale far longer 
than the individual human life. (Parry & Bloch 1989: 1) 


In terms that apply more generally to the micro/macro issue, there is “some- 
thing very general about the relationship between the transient individual and 
the enduring social order which transcends the individual” (Parry & Bloch 1989: 
2). It brings to mind Adam Smith’s (1776: book 4, ch. 2) discussion of the re- 
lation between the motivations of individuals and the not-necessarily-intended 
community-level aggregate effects of their behavior (Schelling 1978; Hedström 
& Swedberg 1998; Rogers 2003). Parry & Bloch (1989: 29) contrast “short-term 
order” with “long-term reproduction”, and they suggest that the two must be 
linked. 


5 The micro/macro solution 


This brings us back to the transmission criterion, an idea that will help to bridge 
the micro/macro divide. If a person is to function as a member of a social group, 
he or she needs to individually construct, in the ontogenetic frame, the ability to 
produce and properly interpret the normative behavior of others. 

Not even a cultural totality is exempt from the transmission criterion. Individ- 
ual people have to learn the component parts of a totality during their lifetimes 
(in ontogeny), and they must be motivated to reproduce the behaviors (in mi- 
crogeny and enchrony) that stabilize the totality and cause it to endure beyond 
their own lives and lifetimes (in diachrony). A person’s motivation can be in the 
form of a salient external pressure such as the threat of state violence. But it 
usually comes from the less visible force of normative accountability (Heritage 
1984; Enfield 2013). 

In the social/cultural contexts of our daily lives, everything we do will be inter- 
preted as meaningful. “The big question is not whether actors understand each 
other or not”, wrote Garfinkel (1952: 367). “The fact is that they do understand 
each other, that they will understand each other, but the catch is that they will 
understand each other regardless of how they would be understood.” This means 
that if you are a member of a social group, you are not exempt from having oth- 
ers take your actions to have meanings, whether or not these were the meanings 
you wanted your actions to have. 

As Levinson (1983: 321) phrases it, also echoing Goffman and Sacks, we are 
“not so much constrained by rules or sanctions, as caught up in a web of infer- 
ences’. We will be held to account for others” interpretations of our behavior 
and we know this whether we like it or not.! This is a powerful force in getting 
us to conform. Accountability to norms “constitutes the foundation of socially 
organized conduct as a self-producing environment of ‘perceivedly normal’ ac- 
tivities” (Heritage 1984: 119). The thing that tells us what counts as normal is of 
course the culture. 


With respect to the production of normatively appropriate conduct, all that 
is required is that the actors have, and attribute to one another, a reflexive 
awareness of the normative accountability of their actions. For actors who, 
under these conditions, calculate the consequences of their actions in re- 
flexively transforming the circumstances and relationships in which they 
find themselves, will routinely find that their interests are well served by 


1 This does not mean that we are accountable for just any interpretation, but only those inter- 
pretations that are grounded in social norms. For example, if you are in the habit of going 
barefoot on the street, you can expect people to draw attention to this whether you like it or 
not (in a way that they will not if you are in the habit of wearing shoes). 
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normatively appropriate conduct. With respect to the anarchy of interests, 
the choice is not between normatively organized co-operative conduct and 
the disorganized pursuit of interests. Rather, normative accountability is 
the “grid” by reference to which whatever is done will become visible and 
assessable. (Heritage 1984: 117) 


One might ask what is “normatively appropriate conduct”. The answer must 
include any of the kinds of behaviors discussed in the above section on cultural 
systems: for example, behaving in accordance with the rules of a section system 
by marrying someone of the right category (or being able to give reasons why 
you have done otherwise). They would not be cultural behaviors if they were not 
regimented in a community by accountability to norms (and probably also laws). 

So the path that is both the least resistant and the most empowering for a per- 
son is to learn the system that generates a shared set of normative interpretations 
of people’s behavior, and then go with the flow. This is how the totality cannot 
exist without the individuals, while - paradoxically — appearing to do just that. 
We create and maintain the very systems that constrain us. 

The close relationship between short-term order and long-term reproduction is 
an asymmetrical one. Short-term order is where the causal locus of transmission 
is found. It is where acceleration, deceleration, and transformation in cultural 
transmission occurs (Schelling 1978; Sperber 1985, 1996; Rogers 2003). 

From all of this it is clear that cultural systems exist and they both constrain us 
and guide us. The question is: How are systems transmitted? The regulation of 
individual behavior in the cultural totality is not achieved by mere emergence. It 
is not like the self-oriented behavior of a bird in the seemingly concerted move- 
ment of a flock. Individuals’ behavior is regulated by norms, in an effectively 
telic way. A good deal of cultural regimentation is done through explicit instruc- 
tion, often with reference to norms, and sometimes with reference to punishable 
laws. 

To see how whole cultural systems are transmitted, we have to draw on item- 
based processes of transmission. As we saw in the last chapter, the only good 
causal account we have for social transmission through populations and across 
generations is one that works in terms of items, not whole systems. 


5.1 The combinatoric nature of cultural items in general 


Recall that the context bias is grounded in the fact that one cannot behold any so- 
called item without beholding it in relation to something else, including not only 
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things of similar kinds, but also the social norms and intentions associated with 
items and the contexts in which they appear. So, I cannot know what a hammer 
is if I do not see it in relation to the human body, timber and nails, people’s 
intentions to build things, conventional techniques for construction, and so on. 

These relations - which themselves are interrelated — form an indispensible 
part of what I am referring to by the term item. When a cultural item diffuses, 
what is diffusing is something less like an object and more like a combinatoric 
relation. So, a hammer incorporates a handle or grip. The handle or grip has a 
combinatoric relation to the human hand insofar as the handle and the hand are 
practically and normatively designed to go together. The handle is designed that 
way because of how the human hand is. The handle only makes sense in terms 
of a person’s hand. 

This going together of the handle and the human hand is like a grammatical 
rule. In a similar way, the handle of the hammer and the head of the hammer 
go together both practically and normatively. The head of the hammer, in turn, 
goes with a nail. The nail, in turn, goes with timber, and so forth. So we see 
how the cultural items that diffuse in communities necessarily incorporate - and 
advertize — their rules of fit with other items. 

The sprawling yet structured systems that we call languages have the same 
kinds of properties of incorporation and contextualization that I have just de- 
scribed for concrete objects. So, if speakers of a language have borrowed a word 
from another language, this does not mean they have merely adopted a pairing 
of sound and concept. They must also have adopted a way of relating the word 
to their existing language system (whether or not this relation resembles the one 
used in the source system). 

The word will not be usable if it does not have combinatoric properties that 
specify how it fits with other words. The norms for combining the word in usage 
may be borrowed along with the word itself, or they may be provided by exist- 
ing structures in the borrowing language, or they may even be innovated in the 
process of incorporation. 

The combinatoric relations surrounding a cultural item do not have to diffuse 
along with that item. But a cultural element must have some combinatoric rela- 
tion to other cultural items in the same domain if it is to function and circulate. 
That relation can just as well be invented by the people who adopt the item, in 
line with the contraints of their own culture and world view. This is the point 
that authors like Sahlins and Eckert, mentioned above, have stressed for culture. 

So, structuralist linguists like Donegan & Stampe (1983: 1) are right when they 
say that a language “is not just a collection of autonomous parts”. But this does 
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not mean that a language is “a self-contained whole”. The same applies when 
cultural anthropologists refer to the “cultural totality”. 

We never encounter whole systems except one fragment at a time, in mi- 
crogeny and enchrony. Our “partial experience” (Le Page & Tabouret-Keller 1985: 
191) is not experience ofthe whole system. But nor is it experience of stand-alone 
items. When we experience culture, we experience meaningful items in relations 
of functional incorporation and contextualization with other such items. 

Each such relation is, effectively, a combinatoric principle, like a norm for 
forming a grammatical sentence or for using a hammer and nail in the appropri- 
ate way. These relations are at the center of the framework being proposed here. 
These relations are what is transmitted. They have an inherent connection to a 
cultural system or field, but this system or field has no pre-given size or outer 
borderline. 

Bloch (2000) says that old critiques of diffusionism in anthropology also work 
as critiques of today’s item-based accounts. I would say that the problems are 
handled by the simple conceptual shift being proposed here. The relevant unit of 
cultural transmission (meme or whatever) is not a piece. The relevant unit is a 
piece and its functional relation to a context. This might seem obvious. But when 
we make it explicit, the fear of a disembodied view of cultural units goes away. 
The required conceptual move is not to take items and put them in a context. 
Their relation to a context is what defines them. 


5.2 Solving the item/system problem in language 


Identifying the relation to context as the common unit of analysis of both items 
and systems is necessary but not yet sufficient. We need an account of how 
this scales up into large structured sets of such relations. Let us consider the 
question in connection to language. Every linguistic convention in a community 
is a product of general mechanisms of social diffusion. Each convention has its 
own history. Every word, every morpheme, every construction has followed its 
own historical path to community-level acceptance. As Bloomfield (1933: 444) 
said, “individual forms may have had very different adventures”. 

This does not mean languages are mere bundles of items. They are large, struc- 
tured, systematic wholes. Psychologically, languages exist in people’s minds and 
bodies. They take the form of idiolects. Intersubjectively, languages exist at a 
community level to the extent that people’s idiolects are effectively alike in struc- 
ture and content, as demonstrated by the evidently tolerable degree of success of 
communication (Enfield 2015). We can now specify some forces that bring items 
together and structure them into systems. 
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5.3 Centripetal and systematizing forces 


When we say that two people speak the same language, we mean that two in- 
dividuals’ knowledge of a language system - synchronically, as can be seen in 
their enchronic and microgenetic behavior - is effectively (though never exactly) 
shared. This sharedness exists because a large number of the same linguistic vari- 
ants have been channelled, in a huge set, along the same historical pathways. 
This gives the impression that a language is passed down as a whole, transcend- 
ing lifetime after lifetime of the individuals who learn and embody the system. 

This is the point made by Thomason & Kaufman (1988): Normal social condi- 
tions enable children, as first language learners, to construct idiolects that effec- 
tively match the idiolects of the people they learn from - i.e., those with whom 
children share a household and an immediate social environment, and who are, 
incidentally, most likely to share their genes. Normal transmission is what al- 
lows historical linguists to abstract from the fact that each linguistic variant has 
its own career, and in turn to treat the whole language as having one spatial- 
historical trajectory. 

In many cases this is a reasonable and successful methodological presumption 
(Haspelmath 2004). But in situations other than those of normal transmission 
(Le Page & Tabouret-Keller 1985; Thomason & Kaufman 1988), linguistic items 
do not always travel together, but may follow separate paths, making visible 
what is always true but usually obscured by items’ common destiny in practice, 
namely: Each item has its own history. 

Genealogical continuity in language change is typically taken to be the norm. 
Whenever we see that linguistic systems are permeable, for instance in certain 
language contact situations where the components of languages are prised apart, 
special explanations are demanded. 


5.4 On normal transmission 


To say that a child inherits a language from her parents is a misleading represen- 
tation of what happens in language acquisition. The idiolect of the child is not 
acquired like DNA in a bundle. Patterns of constituency and grammatical rela- 
tions do not unfold in children like the shapes of their bodily organs. Through 
practice, children have to learn, construct, and maintain skills and ideas for ways 
of saying things. 


The “rules” of a child’s “native language” ... are in any case likely to be ten- 
tative hypotheses, easily modified by fresh semantic needs, fresh contacts, 
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fresh analogies. “Syntax” in the grammarian’s sense is what emerges from 
this process, not what it starts from. (Le Page & Tabouret-Keller 1985: 190) 


Logic and universal grammar, then, are targets towards which, rather than 
the starting point from which, human linguistic activity proceeds. The 
origins of that activity are like those of a game which gradually develops 
among players, each of whom can experiment with changes ofthe rules, all 
of whom are umpires judging whether new rules are acceptable. (Le Page 
& Tabouret-Keller 1985: 197) 


This transmission takes place through air, over days, weeks, months, years, 
with interference and noise. Every bit of the idiolect’s structure has to be passed 
over and constructed from scratch by the learner. This task is made possible by 
the sheer deluge of linguistic data - a Niagara of words, as Hayakawa (1978: 12) 
called it - which people are exposed to, and produce in turn. Child language 
acquisition is a process of building (Tomasello 2003), resulting in something like 
a grammatical totality in the child’s competence. But whatever totality a person 
has built, it is instantiated somehow in the head and so (a) will never go public 
as a whole and (b) will be destroyed when the person dies. The system is neither 
observed nor passed on as a whole unit, only ever fragment-by-fragment. 

Dunbar (1996) has hypothesized that prelinguistic human ancestors created 
language as a way to lessen time pressure due to the need to manage an expand- 
ing number of social associates. Sustaining a social network by means of lin- 
guistic contact is time-consuming. Where personal exchange or strong network 
ties are involved, we are necessarily oriented towards a limited group. The size 
of networks is constrained by the time it takes to maintain these relationships. 
However, the number of non-personal exposure ties — passive seeing and hear- 
ing, especially due to media and high population density — is potentially massive. 
The invention of writing has drastically changed the proportion of personal and 
non-personal sources of exposure to innovation. 

Thomason & Kaufman (1988) invoke an idea of normal transmission (see above). 
They define normal transmission “by exclusion” (Thomason & Kaufman 1988: 10), 
in terms of how “perfectly” all sub-systems of a language are reproduced in chil- 
dren’s idiolects. In normal transmission, linguistic input from outgroup people 
has negligible impact on a child’s construction of an idiolect highly convergent 
with the idiolects of the parents’ generation. Normal transmission, in Thomason 
and Kaufman’s sense, is a social fact (Thomason & Kaufman 1988: 12), though it 
is defined by formal facts about child language acquisition in a community: 
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[A] claim of genetic relationship [between a “parent” and a “daughter” lan- 
guage] entails systematic correspondences in all parts of the language be- 
cause that is what results from normal transmission: what is transmitted 
is an entire language - that is, a complex set of interrelated lexical, phono- 
logical, morphosyntactic, and semantic structures. (Thomason & Kaufman 
1988: 11) 


Here is how I understand Thomason and Kaufman’s point. To say that a “ge- 
netic” relationship holds between parent and daughter languages is to use a 
metaphor, and to use this metaphor is harmless as long as the older generation’s 
idiolects are reproduced so closely in the idiolects of the younger generation 
that it is as if the new idiolects were replicas of the old. This is effectively what 
happens in the case of normal transmission. There is a relentless and focussed 
linguistic sign deluge from people of the learner’s own group. 

But another question remains. How can we explain the relative impermeabil- 
ity of linguistic systems in circumstances of normal transmission? Stability in 
conventional systems is no less in need of explanation than variation or change 
(Bourdieu 1977; Sperber 1996; Sperber & Hirschfeld 2004). What are the forces 
that cause linguistic variants to follow en masse a single path of diffusion and 
circulation, and to hold together as structured systems? Let us briefly consider 
three such forces. 


5.4.1 Sociometric closure 


A first centripetal force is sociometric closure. This arises from a trade-off be- 
tween strength and number of relationship ties in a social network. If a person 
is going to maintain a social relationship, she has to commit a certain amount of 
time to this. Time is a finite resource. This puts a structural constraint on the pos- 
sible number of relationships one can maintain (Hill & Dunbar 2003). The result 
is a relatively closed circulation of currency within a social economy of linguis- 
tic items. It causes people’s inventories of items (i.e., their vocabularies, etc.) to 
overlap significantly, or to be effectively identical, within social networks. 

This helps to account for how people who interact often can have a common 
set of variants. It does not account for the system-like nature of the relations 
among those items. We turn now to two forces of systematization inherent to 
grammar, in the paradigmatic and syntagmatic axes. 
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5.4.2 Trade-off effects 


One systematizing force comes from functional trade-off effects that arise when 
a goal-oriented person has alternative means to similar ends. When different 
items come to be used in a single functional domain, those items can become 
formally and structurally affected by their relative status in the set. This happens 
because the items compete for a single resource, namely, our selection of them 
as means for our communicative ends. 

When Zipf (1949) undertook “a study of human speech as a set of tools”, he 
compared the words of a language with the tools in an artisan’s workshop. Dif- 
ferent items have different functions, and different relative functional loads. In 
a vocabulary, Zipf (1949: 21) argued, there is an internal economy of words, with 
trade-offs that result in system effects like the observed correlation between the 
length of a word (relative to other words) and the frequency of use ofthe word 
(relative to that of other words). 

Zipf reasoned that “the more frequent tools will tend to be the lighter, smaller, 
older, more versatile tools, and also the tools that are more thoroughly integrated 
with the action of other tools” (Zipf 1949: 73). He showed that the more we regard 
a set of available means as alternatives to each other in a functional domain, the 
more they become defined in terms of each other, acquiring new characteristics 
as a result of their role in the economy they operate in. In other words: The more 
we treat a set of items as a system, the more it becomes a system. 


5.4.3 Item-utterance fit, aka content-frame fit 


A final key source of grammatical structure is grammatical structure itself. The 
utterance is a core structural locus in language. An utterance is a local context 
for the interpretation of a linguistic item. It is an essential ratchet between item 
and system. As Kirby writes, although “semantic information” is what linguistic 
utterances most obviously convey, “there is another kind of information that 
can be conveyed by any linguistic production, and that is information about the 
linguistic system itself”. 


When I produce the sentence “these berries are good” I may be propagating 
cultural information about the edibility of items in the environment via 
the content of the sentence. At the same time I may also be propagating 
information about the construction of sentences in my language. (Kirby 
2013: 123) 


59 


5 The micro/macro solution 


In this way, an utterance is a frame and a vehicle for replicating linguistic 
variants (Croft 2000). 

Item-utterance fit is the structural fit between diffusible types of linguistic 
items and the token utterances in which they appear. It is an instance of the 
more general content-frame schema (Levelt 1989) also observed in phonology 
(MacNeilage 1998; see Enfield 2013: 54-55), and a case of the “functional rela- 
tion to context” defined above as acommon property of items and systems. Now 
we see that it is not just acommon property. It is the very property that connects 
items with systems. An utterance is an incorporating and contextualizing frame 
for the diffusion of replicable linguistic items, and it is a frame for the diffusion 
of the combinatoric rules from which the higher-level system is built. 


5.5 A solution to the item/system problem? 


The above considerations suggest that the item/system problem can be solved if 
the following three forces apply in the biased transmission of cultural items: 


1. Congregation: Items are brought together and “bundled” by the population- 
level effects of inward-directed sociometric biases. 


2. Specialization: Items then effectively compete for selection in the same 
functional contexts, and come to be specialized as alternative means for 
related functional ends. 


3. Combination: Items in a set come to combine with each other in functional 
ways, via context biases and the relation of item-utterance fit. 


We can expect there to be analogous relations to item-utterance fit (=content- 
frame fit) in the domain of culture. Think, for instance, of systems of social re- 
lations in kinship, or systems of material culture and technology in households 
and villages. 

Zipf’s (1949) analogy is useful here. For his “economy of tools-for-jobs and jobs- 
for-tools” to get off the ground, one first needs a workshop, somewhere the set of 
tools is assembled in one place, and made accessible to a person with goals. In 
language and culture, this is achieved by sociometric closure (§ 5.4.1, above): the 
more you talk with certain people, the more ways of talking you will share with 
these people. Then, one works with the set of tools, using them as alternative 
specialized means to similar or related ends (§ 5.4.2, above). Finally, these tools 
will, whether by design or by nature, enter into the relations of incorporation 
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and contextualization that define their both their functional potential and their 
system status (§ 5.4.3, above). 

Now this should look familiar to the linguist. Once we get an inventory or 
lexicon of items that have specalized functions within a given domain, they will 
naturally enter into the paradigmatic and syntagmatic relations that define semi- 
otic systems in the classical sense. 


61 


6 Conclusion 


Ever since Darwin’s earliest remarks on the uncanny similarity between lan- 
guage change and natural history in biology, there has been a persistent con- 
ceptual unclarity in evolutionary approaches to cultural change. This unclarity 
concerns the units of analysis. 

In some cases the unit is said to be the language system as a whole. A language, 
then, is “like a species” (Darwin 1871: 60; cf. Mufwene 2001: 192-194). If so, then 
we are talking about a population of idiolects that is coterminous with a popula- 
tion of bodies (allowing, of course, that in the typical situation - multilingualism 
- one body houses more than one linguistic system). 

On another view, the unit of analysis is any unit that forms part of a language, 
such as a word or a piece of grammar. "A struggle for life is constantly going 
on amongst the words and grammatical forms in each language” (Müller 1870, 
cited in Darwin 1871: 60). By contrast with the idea of populations of idiolects, 
this suggests that there are populations of items (akin to Zipf’s economy of word- 
tools), where these items are produced, and perceived, in the context of spoken 
utterances. 

While some of us instinctively think first in terms of items, and others of us 
first in terms of systems, we do not have the luxury of ignoring either. Neither an 
item nor a system can exist without the other. The challenge is to characterize 
the relation between the two. This relation is the one thing that defines them 
both. 

The issue is not just the relative status of items and systems but the causal 
relations between them. If the distinction between item and system is a matter 
of framing, it is no less consequential for that. We not only have to define the 
differences between item phenomena and system phenomena, we must know 
which ones we are talking about and when. And we must show whether, and if 
so how, we can translate statements about one into statements about the other. 


6 Conclusion 


6.1 Natural causes of language 


“We might gain considerable insight into the mainsprings of human behavior”, 
wrote Zipf (1949: v), “if we viewed it purely as a natural phenomenon like every- 
thing else in the universe”. This does not mean that we cannot embrace the an- 
thropocentrism, subjectivity, and self-reflexivity of human affairs. It does mean 
that underneath all of that, our analyses remain accountable to natural, causal 
claims. In this book we have developed a causally explicit model for the trans- 
mission of cultural items, and we have approached a solution to the item/system 
problem that builds solely on these item-based biases. I submit that the biases 
required for item evolution — never forgetting that “item” here really means 
“something-and-its-functional-relation-to-a-context” — are sufficient not only to 
account for how and why certain cultural items win or lose. They also account 
for the key relational forces that link items with systems. 

We have confronted the item/system problem. To solve it, we reached for the 
most tangible known causal mechanism for the existence of linguistic and cul- 
tural reality: item-based transmission. The outcome is this. With the right defini- 
tion of “item” — as always having a functional relation to context - we can have 
an item-based account for linguistic and cultural reality that gives us a system 
ontology for free. 


6.2 Toward a framework 


Why do neighboring languages share structures in common? In earlier work on 
language contact, maintenance, and change (Enfield 2003, 2005, 2008, 2011), I 
considered some of the challenges that this question raises. This led me to con- 
front the conceptual problems I have discussed in this book. They are problems 
of causality. What makes languages the way they are? What causes a language 
to have certain features and not others? How permeable are language systems? 
These questions led me to look for a causal account of the ontology of language. 
I have tried in the above chapters to present some of the ideas that came out. 
Together, these ideas suggest a natural, causal framework for understanding the 
foundations of language. The framework has two conceptual components: 


Causal frames: There are multiple frames or “time-scales” within which 
change in linguistic and other cultural systems can be causally effected. 
While most approaches work within just one or two of these frames, all 
of these frames should be considered together, with special attention to 


64 


6.2 Toward a framework 


the links between them. As explicated in Chapter 2, the framework recog- 
nizes six such frames, under the rubric of MOPEDS: microgenetic, invok- 
ing cognitive and motoric processes for producing and comprehending lan- 
guage and other goal-directed behavior; ontogenetic, invoking lifespan pro- 
cesses by which people, usually as children, acquire linguistic and cultural 
knowledge and skills; phylogenetic, invoking ways in which the requisite 
cognitive capacities have evolved in our species; enchronic, invoking the 
sequential interlocking of social actions in linguistic clothing; diachronic, 
invoking historical change, conducted socially in human populations; and 
synchronic, any approach, such as linguistic or ethnographic description, 
that does not explicitly invoke notions of process. 


Transmission biases: A socially- and cognitively-grounded account of the 
genesis, diffusion, and conventionalization of innovations in human pop- 
ulations must provide a causal basis for how it is that social conventions 
— such as the linguistic and ethnographic facts that we observe - are the 
way they are. As explicated in Chapter 3, the causal machinery for dif- 
fusion of types of behavior (including language) within a population is a 
driving force - an engine of sorts - with four linked loci: exposure to a bit 
of behavior, representation of that bit of behavior, subsequent reproduction 
of that bit of behavior, and material instantiation of some trace of the be- 
havior (leading to exposure of others, feeding back into the process anew). 
Each locus is a site where the chain of diffusion may be broken, reinforced, 
or transformed: Such breaks, reinforcements, and transformations come 
from biases that may operate on each locus (Chapter 3 gives the details). 
There are many of these biases. Some are cognitive. For example, if a lin- 
guistic construction is easier to learn, it will diffuse better. Some are social. 
For example, if more prestigious people model an innovation, other people 
are more likely to copy it. 


These two conceptual pillars of a framework for understanding the natural 


causes of language should be enough to provide the raw materials for explaining 
the ontology of linguistic systems. 
Linguistic system ontology is a puzzle because items (in contexts) are the only 


things that circulate and yet somehow systems exist. If our conceptual frame- 


work recognizes multiple coexisting causal frames and multiple coexisting loci 


of transmission, it becomes possible to see how gaps and interfaces between 
these frames and loci provide the traction for system emergence. At least it be- 
comes possible to study the problem. Empirical and theoretical investigations 
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will have to draw not only on the linguistics of descriptive grammar, semantics, 
pragmatics, and typology, but also on sociological research on innovation dif- 
fusion, sociolinguistic research on social networks, and the natural science of 
cultural evolution. A framework like this should allow us to be maximally ex- 
plicit about the causal processes that create linguistic and other cultural facts. 
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Natural causes of language 


What causes a language to be the way it is? Some features are universal, 
some are inherited, others are borrowed, and yet others are internally in- 
novated. But no matter where a bit of language is from, it will only exist 
if it has been diffused and kept in circulation through social interaction 
in the history of a community. This book makes the case that a proper 
understanding of the ontology of language systems has to be grounded in 
the causal mechanisms by which linguistic items are socially transmitted, 
in communicative contexts. A biased transmission model provides a basis 
for understanding why certain things and not others are likely to develop, 
spread, and stick in languages. 

Because bits of language are always parts of systems, we also need to 
show how it is that items of knowledge and behavior become structured 
wholes. The book argues that to achieve this, we need to see how causal 
processes apply in multiple frames or “time scales” simultaneously, and we 
need to understand and address each and all of these frames in our work 
on language. This forces us to confront implications that are not always 
comfortable: for example, that “a language” is not a real thing but a conve- 
nient fiction, that language-internal and language-external processes have 
a lot in common, and that tree diagrams are poor conceptual tools for un- 
derstanding the history of languages. By exploring avenues for clear so- 
lutions to these problems, this book suggests a conceptual framework for 
ultimately explaining, in causal terms, what languages are like and why 
they are like that. 
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